Integrating logic-based machine learning and virtual screening to discover new drugs
Journal of Cheminformatics volume 4, Article number: O10 (2012)
Investigational Novel Drug Discovery by Example (INDDEx™) is a technology developed to guide hit to lead discovery by learning rules from existing active compounds that link activity to chemical substructure. INDDEx is based on Inductive Logic Programming , which learns easily interpretable qualitative logic rules from active ligands that give an insight into chemistry, relate molecular substructure to activity, and can be used to guide the next steps of drug design chemistry. Support Vector Machines weight the rules to produce a quantitative model of structure-activity relationships. Whereas earlier testing [2, 3] was performed on single dataset examples, this talk presents the largest and fullest test of the method. The method was benchmarked on the Directory of Useful Decoys (DUD) datasets , using the same methodology described in the paper on the assessment of LASSO  and DOCK. For each of the DUD datasets, the known active ligands were mixed with all the decoy compounds in DUD, and the retrieval rates of INDDEx and DUD were measured when they were trained on 2, 4, and 8 of the known active ligands (Figure 2). Early retrieved compounds showed high topological differences to molecules used as training data, showing the strength of this method for scaffold hopping. This work was supported by a BBSRC case studentship with Equinox Pharma Ltd (http://www.equinoxpharma.com).
Muggleton SH: Inductive logic programming. New Gen Comp. 1995, 13: 245-286. 10.1007/BF03037227.
Amini A, et al: A Novel Logic-Based Approach for Quantitative Toxicology Prediction. J Chem Inf Model. 2007, 47: 998-1006. 10.1021/ci600223d.
Cannon EO, et al: Support vector inductive logic programming outperforms the naive Bayes classifier and inductive logic programming for the classification of bioactive chemical compounds. J Comput Aided Mol Des. 2007, 21: 269-280. 10.1007/s10822-007-9113-3.
Huang N, et al: Benchmarking Sets for Molecular Docking. J Med Chem. 2006, 49: 6789-6801. 10.1021/jm0608356.
Reid D, et al: LASSO-ligand activity by surface similarity order: a new tool for ligand based virtual screening. J Comput Aided Mol Des. 2008, 22: 479-487. 10.1007/s10822-007-9164-5.
About this article
Cite this article
Reynolds, C.R., Sternberg, M.J. Integrating logic-based machine learning and virtual screening to discover new drugs. J Cheminform 4 (Suppl 1), O10 (2012). https://doi.org/10.1186/1758-2946-4-S1-O10
- Support Vector Machine
- Virtual Screening
- Logic Programming