Efficient maximum common subgraph (MCS) searching of large chemical databases

Sayle, Roger A; Batista, Jose; Grant, J Andrew

doi:10.1186/1758-2946-5-S1-O15

Volume 5 Supplement 1

8th German Conference on Chemoinformatics: 26 CIC-Workshop

Oral presentation
Open access
Published: 22 March 2013

Efficient maximum common subgraph (MCS) searching of large chemical databases

Roger A Sayle¹,
Jose Batista² &
J Andrew Grant²

Journal of Cheminformatics volume 5, Article number: O15 (2013) Cite this article

1997 Accesses
4 Citations
Metrics details

Despite dramatic improvements in the hardware resources and computational power available to pharmaceutical researchers over the past few decades, the methods used for assessing the 2D chemical similarity between two molecules hasn't changed much since the 1960s. Here we report a novel chemical database search method that allows the exact size of the maximum common edge subgraph (MCES) between a query molecule and molecules in a database to be calculated rapidly. Using a pre-computed index, the 50 nearest neighbors of a query can be determined in a few seconds, even for databases containing millions of compounds. This work builds upon the previous efforts of Wipke and Rogers in the 1980s [1] and of Messmer and Bunke in the 1990s [2], harnessing the advances in high-performance computing and storage technology now available. A graphical depiction of such a "SmallWorld" index is shown below.

References

Wipke WT, Rogers D: Rapid Subgraph Search using Parallelism. J Chem Inf Comput Sci. 1984, 24: 255-262. 10.1021/ci00044a012.
Article CAS Google Scholar
Messmer BT, Bunke H: Subgraph Isomorphism Detection in Polynomial Time on Preprocessed Graphs. Proc Asian Conf on Computer Vision. 1995, 151-155.
Google Scholar

Download references

Author information

Authors and Affiliations

NextMove Software Limited, Cambridge, Cambridgeshire, CB4 0EY, UK
Roger A Sayle
Discovery Sciences, AstraZeneca R&D, Alderley Park, Cheshire, SK10, UK
Jose Batista & J Andrew Grant

Authors

Roger A Sayle
View author publications
You can also search for this author in PubMed Google Scholar
Jose Batista
View author publications
You can also search for this author in PubMed Google Scholar
J Andrew Grant
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Roger A Sayle.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Sayle, R.A., Batista, J. & Grant, J.A. Efficient maximum common subgraph (MCS) searching of large chemical databases. J Cheminform 5 (Suppl 1), O15 (2013). https://doi.org/10.1186/1758-2946-5-S1-O15

Download citation

Published: 22 March 2013
DOI: https://doi.org/10.1186/1758-2946-5-S1-O15

8th German Conference on Chemoinformatics: 26 CIC-Workshop

Efficient maximum common subgraph (MCS) searching of large chemical databases

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Journal of Cheminformatics

Contact us

8th German Conference on Chemoinformatics: 26 CIC-Workshop

Efficient maximum common subgraph (MCS) searching of large chemical databases

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Journal of Cheminformatics

Contact us