Virtual Screening of Drug Likeness using Tree Based Ensemble Classifier
Ani R1, Roshini Manohar1, Gayathri Anil1 and O.S. Deepa2

1Department of Computer Science and Applications, Amrita Vishwa Vidyapeetham, Amritapuri, India.

2Department of Mathematics, Amrita Vishwa Vidyapeetham, Coimbatore, India.

Corresponding Author E-mail: anir@am.amrita.edu

Abstract: In earlier years, the Drug discovery process took years to identify and process a Drug. It takes a normal of 12 years for a Drug to travel from the research lab to the patient. With the introduction of Machine Learning in Drug discovery, the whole process turned out to be simple. The utilization of computational tools in the early stages of Drug development has expanded in recent decades. A computational procedure carried out in Drug discovery process is Virtual Screening (VS).  VS are used to identify the compounds which can bind to a Drug target. The preliminary process before analyzing the bonding of ligand and drug protein target is the prediction of drug likeness of compounds. The main objective of this study is to predict Drug likeness properties of Drug compounds based on molecular descriptor information using Tree based ensembles. In this study, many classification algorithms are analyzed and the accuracy for the prediction of drug likeness is calculated. The study shows that accuracy of rotation forest outperforms the accuracy of other classification algorithms in the prediction of drug likeness of chemical compounds. The measured accuracies of the Rotation Forest, Random Forest, Support Vector Machines, KNN, Decision Tree and Naïve Bayes are 98%, 97%, 94.8%, 92.8%, 91.4%, 89.5% respectively.

Keywords: Classification Algorithms; Drug likeness; Ensemble Methods; Lipinski’s Rule; Molecular Descriptors; Virtual Screening

[ HTML Full Text]

Back to TOC