Evaluation of different machine learning methods for ligand-based virtual screening

Kurczab, R; Smusz, S; Bojarski, AJ

doi:10.1186/1758-2946-3-S1-P41

Volume 3 Supplement 1

6th German Conference on Chemoinformatics, GCC 2010

Poster presentation
Open access
Published: 19 April 2011

Evaluation of different machine learning methods for ligand-based virtual screening

R Kurczab¹,
S Smusz¹ &
AJ Bojarski¹

Journal of Cheminformatics volume 3, Article number: P41 (2011) Cite this article

2322 Accesses
6 Citations
Metrics details

In silico High Throughput Screening of large compound databases has become increasingly popular technology of finding valuable drug candidates, by applying a wide range of computational methods, such as machine learning [1]. In recent years, many comparative studies of different machine learning methods performance in ligand-based virtual screening have been reported [2, 3].

In order to extend these studies, we have evaluated over 60 different machine learning methods, such as: support vector machines (with and without parameter optimization), naïve Bayesian, decision trees, random forest, meta-classifiers (boosting, bagging, grading) and many others. All calculations were performed using a collection of machine learning algorithms for data mining implemented in WEKA package [4]. Additionally, for each of the method, we have examined the influence of different type of fingerprints, the size of training sets and attribute selection methods on the rate of active recall and precision of selection. Our internal database of known 5-HT7 antagonists has been used to build training and testing sets.

It was found that there is no machine learning approach that consistently provides the best results but some of them are very stable and can be applied universally.

References

Melvile J, Burke E, Hirst J: Machine Learning in Virtual Screening. Comb. Chem. High Throughput Screening. 2009, 12: 332-343. 10.2174/138620709788167980.
Article Google Scholar
Plewczynski D, Spieser S, Koch U: Performance of machine learning methods for ligand-based virtual screening. Comb Chem High Throughput Screening. 2009, 12: 358-368. 10.2174/138620709788167962.
Article CAS Google Scholar
Ma X, Jia J, Zhu F, Xue Y, Li Z, Chen Y: Comparative analysis of machine learning methods in ligand-based virtual screening of large compound libraries. Comb Chem High Throughput Screening. 2009, 12: 344-357. 10.2174/138620709788167944.
Article CAS Google Scholar
Hall M, Frank E, Holmes G, Pfahringer B, Reutemann P, Witten I: The WEKA Data Mining Software. SIGKDD Explorations. 2009, 11: 10-18. 10.1145/1656274.1656278.
Article Google Scholar

Download references

Acknowledgements

The study was partly supported by a grant PNRF-103-AI-1/07 from Norway through the Norwegian Financial Mechanism.

Author information

Authors and Affiliations

Department of Medicinal Chemistry, Institute of Pharmacology Polish Academy of Sciences, Krakow, 31-343, Poland
R Kurczab, S Smusz & AJ Bojarski

Authors

R Kurczab
View author publications
You can also search for this author in PubMed Google Scholar
S Smusz
View author publications
You can also search for this author in PubMed Google Scholar
AJ Bojarski
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to R Kurczab.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Kurczab, R., Smusz, S. & Bojarski, A. Evaluation of different machine learning methods for ligand-based virtual screening. J Cheminform 3 (Suppl 1), P41 (2011). https://doi.org/10.1186/1758-2946-3-S1-P41

Download citation

Published: 19 April 2011
DOI: https://doi.org/10.1186/1758-2946-3-S1-P41

6th German Conference on Chemoinformatics, GCC 2010

Evaluation of different machine learning methods for ligand-based virtual screening

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Journal of Cheminformatics

Contact us

6th German Conference on Chemoinformatics, GCC 2010

Evaluation of different machine learning methods for ligand-based virtual screening

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Journal of Cheminformatics

Contact us