Representation and searching of biomolecules

Durant, Joeseph L; Chen, WL; Christie, BD; Grier, DL; Leland, BA; Nourse, JG

doi:10.1186/1758-2946-2-S1-O4

Volume 2 Supplement 1

5th German Conference on Cheminformatics: 23. CIC-Workshop

Oral presentation
Open access
Published: 04 May 2010

Representation and searching of biomolecules

Joeseph L Durant¹,
WL Chen¹,
BD Christie¹,
DL Grier¹,
BA Leland¹ &
…
JG Nourse¹

Journal of Cheminformatics volume 2, Article number: O4 (2010) Cite this article

1902 Accesses
Metrics details

Biomolecules present challenges to chemical information systems designed for small molecules. Their sizes, up to tens of thousands of atoms, overwhelm representation/storage/searching solutions built on explicit chemical representation of the structures. But biomolecules are largely made up of many repeats of a limited number of building-block molecules, a fact which has been used to provide a compressed representation for biomolecules using templates for the building blocks.

We have adopted a modified template-based representation for biomolecules. Our primary interest is in the chemically modified portions of biomolecules, for which we choose to use explicit chemistry. These areas of explicit chemistry are then embedded in the template-compressed, unmodified portions of the full biomolecule.

The regions containing explicit chemistry are indexed, and thus can be structure searched with good performance. A limited number of residues surrounding explicit chemistry regions are included in the index for searching the context of these explicit regions. By using explicit chemistry to represent modified regions we can search across classes of modifications for common features. For example a single substructure search query will find green fluorescent protein, and its histidine, phenylalanine and tryptophan analogs.

Templates are stored with the structure providing a self-contained file format. The use of NEMA keys allows templates from different structures to be compared, and allows storage of structures containing a canonical list of templates. The residues have defined attachment points, allowing automated traversal of a protein backbone, or location of non-backbone bonds to residues.

We will present example structures and structural queries highlighting capabilities of our representation.

Author information

Authors and Affiliations

Symyx Technologies, 2440 Camino Ramon, San Ramon, California, USA
Joeseph L Durant, WL Chen, BD Christie, DL Grier, BA Leland & JG Nourse

Authors

Joeseph L Durant
View author publications
You can also search for this author in PubMed Google Scholar
WL Chen
View author publications
You can also search for this author in PubMed Google Scholar
BD Christie
View author publications
You can also search for this author in PubMed Google Scholar
DL Grier
View author publications
You can also search for this author in PubMed Google Scholar
BA Leland
View author publications
You can also search for this author in PubMed Google Scholar
JG Nourse
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Open Access This is an open access article distributed under the terms of the Creative Commons Attribution Noncommercial License ( https://creativecommons.org/licenses/by-nc/2.0 ), which permits any noncommercial use, distribution, and reproduction in any medium, provided the original author(s) and source are credited.

Reprints and permissions

About this article

Cite this article

Durant, J.L., Chen, W., Christie, B. et al. Representation and searching of biomolecules. J Cheminform 2 (Suppl 1), O4 (2010). https://doi.org/10.1186/1758-2946-2-S1-O4

Download citation

Published: 04 May 2010
DOI: https://doi.org/10.1186/1758-2946-2-S1-O4

5th German Conference on Cheminformatics: 23. CIC-Workshop

Representation and searching of biomolecules

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Keywords

Journal of Cheminformatics

Contact us

5th German Conference on Cheminformatics: 23. CIC-Workshop

Representation and searching of biomolecules

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Journal of Cheminformatics

Contact us