Graph kernels for chemoinformatics – a critical discussion

Rupp, Matthias

doi:10.1186/1758-2946-3-S1-O8

Volume 3 Supplement 1

6th German Conference on Chemoinformatics, GCC 2010

Oral presentation
Open access
Published: 19 April 2011

Graph kernels for chemoinformatics – a critical discussion

Matthias Rupp¹

Journal of Cheminformatics volume 3, Article number: O8 (2011) Cite this article

2934 Accesses
Metrics details

We analyze the use, advantages, and drawbacks of graph kernels in chemoin-formatics, including a comparison of kernel-based approaches with other methodology, as well as examples of applications.

Kernel-based machine learning [1], now widely applied in chemoinformatics, delivers state-of-the-art performance [2] in tasks like classification and regression. Molecular graph kernels [3] are a recent development where kernels are defined directly on the molecular structure graph. This allows the adaptation of methods from graph theory to structure graphs and their direct use with kernel learning algorithms. The main advantage of kernel learning, the so-called "kernel trick", allows for a systematic, computationally feasible, and often globally optimal search for non-linear patterns, as well as the direct use of non-numerical inputs such as strings and graphs. A drawback is that solutions are expressed indirectly in terms of similarity to training samples, and runtimes that are typically quadratic or cubic in the number of training samples.

Graph kernels [3] are positive semidefinite functions defined directly on graphs. The most important types are based on random walks, subgraph patterns, optimal assignments, and graphlets. Molecular structure graphs have strong properties that can be exploited [4], e.g., they are undirected, have no self-loops and no multiple edges, are connected (except for salts), annotated, often planar in the graph-theoretic sense, and their vertex degree is bounded by a small constant. In many applications, they are small. Many graph kernels are general-purpose, some are suitable for structure graphs, and a few have been explicitly designed for them.

We present three exemplary applications of the iterative similarity optimal assignment kernel [5], which was designed for the comparison of small structure graphs: The discovery of novel agonists of the peroxisome proliferator-activated receptor γ [6] (ligand-based virtual screening), the estimation of acid dissociation constants [7] (quantitative structure-property relationships), and molecular de novo design [8].

References

Hofmann T, Schölkopf B, Smola A: Kernel methods in machine learning. Ann Stat. 2008, 36 (6): 1171-1220. 10.1214/009053607000000677.
Article Google Scholar
Invanciuc O: Applications of Support Vector Machines in Chemistry. Reviews in Computational Chemistry 23. Edited by: Lipkowitz K, Cundari T. 2007, Weinheim: Wiley-VCH, 291-400. full_text.
Chapter Google Scholar
Rupp M, Schneider G: Graph kernels for molecular similarity. Mol Inf. 2010, 29 (4): 266-273.
Article CAS Google Scholar
Rupp M: Kernel Methods for Virtual Screening. PhD thesis. 2009
Google Scholar
Rupp M, Proschak E, Schneider G: Kernel approach to molecular similarity based on iterative graph similarity. J Chem Inf Model. 2007, 47 (6): 2280-2286. 10.1021/ci700274r.
Article CAS Google Scholar
Rupp M, Schroeter T, Steri R, Zettl H, Proschak E, Hansen K, Rau O, Schwarz O, Müller-Kuhrt L, Schubert-Zsilavecz M, Müller K-R, Schneider G: From machine learning to natural produt derivatives selectively activating transcription factor PPARg. ChemMedChem. 2010, 5 (2): 191-194. 10.1002/cmdc.200900469.
Article CAS Google Scholar
Rupp M, Körner R, Tetko I: 2010, submitted
Hartenfeller M, Schneider G: De novo drug design. Meth Mol Biol. 2011, 672: 299-323. full_text.
Article CAS Google Scholar

Download references

Author information

Authors and Affiliations

Helmholtz Zentrum München, Deutsches Forschungszentrum für Umwelt & Gesundheit, Ingolstädter Landstr. 1, 85764, Neuherberg, Germany
Matthias Rupp

Authors

Matthias Rupp
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Matthias Rupp.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Rupp, M. Graph kernels for chemoinformatics – a critical discussion. J Cheminform 3 (Suppl 1), O8 (2011). https://doi.org/10.1186/1758-2946-3-S1-O8

Download citation

Published: 19 April 2011
DOI: https://doi.org/10.1186/1758-2946-3-S1-O8

6th German Conference on Chemoinformatics, GCC 2010

Graph kernels for chemoinformatics – a critical discussion

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Journal of Cheminformatics

Contact us

6th German Conference on Chemoinformatics, GCC 2010

Graph kernels for chemoinformatics – a critical discussion

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Journal of Cheminformatics

Contact us