A large scale classification of molecular fingerprints for the chemical space representation and SAR analysis

López-Vallejo, Fabian; Waddell, Jacob; Yongye, Austin B; Houghten, Richard A; Medina-Franco, José L

doi:10.1186/1758-2946-4-S1-P26

Volume 4 Supplement 1

7th German Conference on Chemoinformatics: 25 CIC-Workshop

Poster presentation
Open access
Published: 01 May 2012

A large scale classification of molecular fingerprints for the chemical space representation and SAR analysis

Fabian López-Vallejo¹,
Jacob Waddell¹,
Austin B Yongye¹,
Richard A Houghten¹ &
…
José L Medina-Franco¹

Journal of Cheminformatics volume 4, Article number: P26 (2012) Cite this article

1819 Accesses
1 Citations
Metrics details

Fingerprint-based structure representation has a broad range of applications including, but not limited to, diversity analysis, compound classification, chemical space visualization [1], activity landscape modelling and similarity searching. It has been shown that depending on the particular fingerprints used, the outcome of similarity searching [2] or activity landscapes [3] can be very different. Combining structure representations is a common practice to increase the performance of similarity searching [4]. Also, combining representations for activity landscape modelling has been proposed to generate robust descriptive SAR models [5]. However, the selection of fingerprints to be combined is not an easy task. As part of our efforts to select fingerprint representations to generate consensus representations of chemical space and activity landscapes [5, 6] herein we discuss the results of a systematic comparison of more than 10 2D and 3D fingerprint representations in terms of performance in diversity analysis (as opposed to similarity searching). We employed more than 20 data sets from different sources relevant to drug discovery. In this work the widely used Tanimoto coefficient was employed. The approach presented here can be easily extended to other similarity measures, additional fingerprints and molecular databases. We also discuss the typical mean/median similarity values of selected fingerprints across databases from different sources.

References

Medina-Franco JL, Martínez-Mayorga K, Giulianotti MA, Houghten RA, Pinilla C: Visualization of the Chemical Space in Drug Discovery. Curr Comput-Aided Drug Des. 2008, 4: 322-333. 10.2174/157340908786786010.
Article CAS Google Scholar
Bender A: How Similar Are Those Molecules after All? Use Two Descriptors and You Will Have Three Different Answers. Expert Opin Drug Discovery. 2010, 5: 1141-1151. 10.1517/17460441.2010.517832.
Article CAS Google Scholar
Wassermann AM, Wawer M, Bajorath J: Activity Landscape Representations for Structure-Activity Relationship Analysis. J Med Chem. 2010, 53: 8209-8223. 10.1021/jm100933w.
Article CAS Google Scholar
Chen B, Mueller C, Willett P: Combination Rules for Group Fusion in Similarity-Based Virtual Screening. Mol Inf. 2010, 29: 533-541. 10.1002/minf.201000050.
Article CAS Google Scholar
Yongye A, Byler K, Santos R, Martínez-Mayorga K, Maggiora GM, Medina-Franco JL: Consensus Models of Activity Landscapes with Multiple Chemical, Conformer and Property Representations. J Chem Inf Model. 2011, 51: 1259-1270. 10.1021/ci200081k.
Article CAS Google Scholar
López-Vallejo F, Nefzi A, Bender A, Owen JR, Nabney IT, Houghten RA, Medina-Franco JL: Increased Diversity of Libraries from Libraries: Chemoinformatic Analysis of Bis-Diazacyclic Libraries. Chem Biol Drug Des. 2011, 77: 328-342. 10.1111/j.1747-0285.2011.01100.x.
Article Google Scholar

Download references

Author information

Authors and Affiliations

Torrey Pines Institute for Molecular Studies, Port St. Lucie, Florida, 34987, USA
Fabian López-Vallejo, Jacob Waddell, Austin B Yongye, Richard A Houghten & José L Medina-Franco

Authors

Fabian López-Vallejo
View author publications
You can also search for this author in PubMed Google Scholar
Jacob Waddell
View author publications
You can also search for this author in PubMed Google Scholar
Austin B Yongye
View author publications
You can also search for this author in PubMed Google Scholar
Richard A Houghten
View author publications
You can also search for this author in PubMed Google Scholar
José L Medina-Franco
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

López-Vallejo, F., Waddell, J., Yongye, A.B. et al. A large scale classification of molecular fingerprints for the chemical space representation and SAR analysis. J Cheminform 4 (Suppl 1), P26 (2012). https://doi.org/10.1186/1758-2946-4-S1-P26

Download citation

Published: 01 May 2012
DOI: https://doi.org/10.1186/1758-2946-4-S1-P26

7th German Conference on Chemoinformatics: 25 CIC-Workshop

A large scale classification of molecular fingerprints for the chemical space representation and SAR analysis

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Keywords

Journal of Cheminformatics

Contact us

7th German Conference on Chemoinformatics: 25 CIC-Workshop

A large scale classification of molecular fingerprints for the chemical space representation and SAR analysis

References

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Journal of Cheminformatics

Contact us