Molecular bioactivity extrapolation to novel targets by support vector machines

Van Westen, Gerard JP; Wegner, JK; IJzerman, AP; Van Vlijmen, HWT; Bender, A

doi:10.1186/1758-2946-2-S1-O3

Volume 2 Supplement 1

5th German Conference on Cheminformatics: 23. CIC-Workshop

Oral presentation
Open access
Published: 04 May 2010

Molecular bioactivity extrapolation to novel targets by support vector machines

Gerard JP Van Westen¹,
JK Wegner²,
AP IJzerman¹,
HWT Van Vlijmen² &
…
A Bender¹

Journal of Cheminformatics volume 2, Article number: O3 (2010) Cite this article

2211 Accesses
1 Citations
Metrics details

The early phases of drug discovery use in silico models to rationalize structure activity relationships, and to predict the activity of novel compounds. However, the performance of these models is not always acceptable and the reliability of external predictions - both to novel compounds and to related protein targets - is often limited. Proteochemometric modeling [1] adds a target description, based on physicochemical properties of the binding site, to these models.

Our proteochemometric models [2] are based on Scitegic circular fingerprints on the compound side and on a customized protein fingerprint on the target side. This protein fingerprint is based on a selection of physicochemical descriptors obtained from the AAindex database. Through PCA we selected a number of physicochemical properties which are hashed in a fingerprint using the Scitegic hashing algorithm. We compared this fingerprint to a number of protein descriptors previously published, including the Z-scales, the FASGAI and the BLOSUM descriptors. Our fingerprint performs superior to all of these. In addition, we show that proteochemometric models improve external prediction capabilities. In the case of classification this leads to models with a higher specificity when compared to conventional QSAR. In the case of regression our models show an average lower RMSE of 0.12 log units when based on a pIC50 output variable compared to conventional QSAR modeling the same data-set. Furthermore, our models enable target extrapolation. As a result we can predict the activity of known and new compounds on new targets while retaining the same model quality as when performing external validation without target extrapolation.

References

Freyhult E, Prusis P, Lapinsh M, Wikberg JE, Moulton V, Gustafsson MG: Unbiased descriptor and parameter selection confirms the potential of proteochemometric modelling. BMC Bioinformatics. 2005, 6: 50-10.1186/1471-2105-6-50.
Article Google Scholar
Doddareddy MKR, Van Westen GJP, Horst Van der E, Peironcely JE, Corthals F, IJzerman AP, Emmerich M, Jenkins JL, Bender A: Chemogenomics: Looking at Biology through the Lens of Chemistry. Stat Anal Data Mining. 2009.
Google Scholar

Download references

Author information

Authors and Affiliations

Amsterdam Center for Drug Research, Einsteinweg 55, 2333 CC, Leiden, The Netherlands
Gerard JP Van Westen, AP IJzerman & A Bender
Tibotec, Gen De Wittelaan L 11B 3, 2800, Mechelen, Belgium
JK Wegner & HWT Van Vlijmen

Authors

Gerard JP Van Westen
View author publications
You can also search for this author in PubMed Google Scholar
JK Wegner
View author publications
You can also search for this author in PubMed Google Scholar
AP IJzerman
View author publications
You can also search for this author in PubMed Google Scholar
HWT Van Vlijmen
View author publications
You can also search for this author in PubMed Google Scholar
A Bender
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Gerard JP Van Westen.

Rights and permissions

Open Access This is an open access article distributed under the terms of the Creative Commons Attribution Noncommercial License ( https://creativecommons.org/licenses/by-nc/2.0 ), which permits any noncommercial use, distribution, and reproduction in any medium, provided the original author(s) and source are credited.

Reprints and permissions

About this article

Cite this article

Van Westen, G.J., Wegner, J., IJzerman, A. et al. Molecular bioactivity extrapolation to novel targets by support vector machines. J Cheminform 2 (Suppl 1), O3 (2010). https://doi.org/10.1186/1758-2946-2-S1-O3

Download citation

Published: 04 May 2010
DOI: https://doi.org/10.1186/1758-2946-2-S1-O3

5th German Conference on Cheminformatics: 23. CIC-Workshop

Molecular bioactivity extrapolation to novel targets by support vector machines

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Journal of Cheminformatics

Contact us

5th German Conference on Cheminformatics: 23. CIC-Workshop

Molecular bioactivity extrapolation to novel targets by support vector machines

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Journal of Cheminformatics

Contact us