This article aims to build conventional machine learning models in python languages, such as Random Forest and Linear Regression, to predict Bioactivity values of a molecule from the ChEMBL dataset.

What you will learn

  • How to Use ChEMBL Python API for Data collection
  • Practical Application of Lipinski's descriptors, PaDEL.
  • Using rdkit python library

Installing the required libraries

pip install pandas
pip install np
pip install matplotlib
pip install sklearn
pip install chembl_webresource_client
pip install seaborn
##Now in order to install rkdit use
conda install -c conda-forge rdkit

In case you face any error, please visit this link.

Also, note that you must have python version 2.7 - 3.6…

