Making the most of approximate maximum common substructure search

Englert, Péter; Kovács, Péter

doi:10.1186/1758-2946-6-S1-P29

Volume 6 Supplement 1

9th German Conference on Chemoinformatics

Poster presentation
Open access
Published: 11 March 2014

Making the most of approximate maximum common substructure search

Péter Englert^1,2 &
Péter Kovács^1,2

Journal of Cheminformatics volume 6, Article number: P29 (2014) Cite this article

1322 Accesses
1 Citations
Metrics details

The maximum common substructure (MCS) problem is of great importance in multiple aspects of chemoinformatics. It has diverse applications ranging from lead prediction to automated reaction mapping and visual alignment of similar compounds. Many different algorithms have been developed [1], both exact and approximate. Since the MCS problem is NP-complete, the strict time constraints of most applications can only be realistically satisfied by fast and robust approximation methods.

We developed two efficient heuristic algorithms. One is based on the popular approach of reducing the MCS problem to finding the maximum clique in the modular product of the two molecule graphs. The other is based on a new algorithm by Kawabata, called the build-up method [2]. We also incorporated other techniques, for example, the topological fingerprinting primarily used in substructure and similarity searching. We optimized our implementations for use in multiple applications developed at ChemAxon. In some applications, for example, hierarchical MCS-based clustering or similarity search in large databases, the algorithms are required to give close to optimal results in limited time. To meet these conflicting demands, our implementations were enhanced with strong heuristics. Upper bound calculation methods were also applied for screening and early termination purposes.

In other applications, for example, reaction mapping or visual alignment, the challenge is that topological features must also be taken into account. Apart from the size of the found common substructure, the determined one-to-one correspondence between the atoms of the molecules is also very important. Effective heuristics were developed to guide the algorithms to prefer those solutions in which the relative positions of the common fragments of the input molecules are as similar as possible.

Our implementations have been thoroughly tested and benchmarked. They have also been compared to publicly available solution methods, and integrated into different products at ChemAxon. This has shown that the presented MCS algorithms can adequately cover the conflicting requirements of typical applications. We present our methods and heuristics along with their effects on running time, memory usage, as well as the size and features of the result.

References

Raymond JW, Willett P: Maximum common subgraph isomorphism algorithms for the matching of chemical structures. Journal of Computer-Aided Molecular Design. 2002, 16: 521-533. 10.1023/A:1021271615909.
Article CAS Google Scholar
Kawabata T: Build-up algorithm for atomic correspondence between chemical structures. Journal of Chemical Information and Modeling. 2011, 51 (8): 1775-1787. 10.1021/ci2001023.
Article CAS Google Scholar

Download references

Author information

Authors and Affiliations

Department of Algorithms and Applications, Eötvös Loránd University, Budapest, H-1117, Hungary
Péter Englert & Péter Kovács
ChemAxon Ltd., Budapest, H-1031, Hungary
Péter Englert & Péter Kovács

Authors

Péter Englert
View author publications
You can also search for this author in PubMed Google Scholar
Péter Kovács
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Péter Kovács.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0), which permits use, duplication, adaptation, distribution, and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Reprints and permissions

About this article

Cite this article

Englert, P., Kovács, P. Making the most of approximate maximum common substructure search. J Cheminform 6 (Suppl 1), P29 (2014). https://doi.org/10.1186/1758-2946-6-S1-P29

Download citation

Published: 11 March 2014
DOI: https://doi.org/10.1186/1758-2946-6-S1-P29

9th German Conference on Chemoinformatics

Making the most of approximate maximum common substructure search

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Journal of Cheminformatics

Contact us

9th German Conference on Chemoinformatics

Making the most of approximate maximum common substructure search

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Journal of Cheminformatics

Contact us