Skip to main content


  • Poster presentation
  • Open Access

INSARA: a new method for the analysis and visualization of Structure-Activity-Relationships

Journal of Cheminformatics20124 (Suppl 1) :P44

  • Published:


  • Lead Optimization
  • Drug Discovery Process
  • Pharmacophoric Feature
  • Ring Problem
  • Reduce Graph

Due to the rapid progress in combinatorial chemistry and (high-throughput-) screening, the organization and mining of the large amount of produced data becomes an increasingly important task in the modern drug discovery process. Herein, one particular challenge is the recognition of SAR-patterns e.g. for the selection of promising compounds for further analysis or lead optimization.

To support the medicinal chemist in doing this job a readily interpretable concept is required. Most published approaches addressing this problem (e.g. SARANEA [1]) use fingerprint similarity for the analysis of molecular relationships. Yet, a promising alternative and more intuitive way of comparing similarity is the maximum common substructure (MCS), the largest substructure in common between two molecules. Since computing the MCS is very demanding, it is usually not applicable to large data sets.

To circumvent this and other drawbacks (e.g. the exact match or incomplete ring problems) our own in-house strategy INSARA (intuitive networks for Structure-Activity-Relationships analysis) employs reduced graphs (RG) instead of the original molecules in order to reduce the complexity of the problem to a manageable size.

The advantage of RG is that only pharmacophoric features and functional units represented by a few pseudoatoms have to be compared [2]. Iterative super- and substructure searches and MCS calculations subsequently lead in an unsupervised manner to SAR networks. When associated with bioactivity data the networks can be used for SAR analysis. While focussing on pharmacophoric properties a more general overview about the similarities within the active set is expected. For initial performance evaluation two targets with well-known SARs (ACE and COX 2) were chosen.

Authors’ Affiliations

Institut für Medizinische und Pharmazeutische Chemie, Technische Universität Braunschweig, Beethovenstr. 55, D-38106 Braunschweig, Germany


  1. Lounkine E, Wawer M, Wassermann AM, Bajorath J: SARANEA: A Freely Available Program to Mine Structure-Activity and Structure-Selectivity Relationship Information in Compound Data Sets. J Chem Inf Model. 2010, 50: 68-78. 10.1021/ci900416a.View ArticleGoogle Scholar
  2. Gardiner EJ, Gillet VJ, Willett P, Cosgrove DA: Representing clusters using a maximum common edge substructure algorithm applied to reduced graphs and molecular graphs. J Chem Inf Model. 2007, 47: 354-366. 10.1021/ci600444g.View ArticleGoogle Scholar


© Wollenhaupt and Baumann; licensee BioMed Central Ltd. 2012

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.