- Poster presentation
- Open Access
Consistency of sugar structures and their annotation in the PDB
Journal of Cheminformaticsvolume 6, Article number: P41 (2014)
Cell-cell recognition is the first stage in many important phenomena such as infection by bacteria and viruses, communication among cells of lower eukaryotes, binding of sperm to egg, etc. . Cell-cell recognition relies on sugar (carbohydrate) specific interactions at the cell surface. Theoretical studies typically involve molecular modeling of sugars and sugar-specific protein receptors. These studies rely on structural information obtained mainly by crystallography and nuclear magnetic resonance, and deposited in the Protein Databank (PDB). Since the main purpose of PDB is to store the structure of proteins and nucleic acids, thus, it is expected that PDB structure files are complete and correctly annotated.
Nonetheless, sugars exhibit a structural diversity larger than amino acids or nucleotides, a property which makes them ideal for recognition. At the same time, sugars are characterized by specific and very sensitive structural features such as multiple chiral centers on each ring. Because of these peculiarities, the validation and annotation of sugar structures is not straightforward.
Our first goal was to develop a methodology that can identify whether a sugar structure is complete and correctly annotated. Our second goal was then to check all PDB entries containing sugars, and record whatever problems we encounter in the sugar structures. For this purpose we collected all sugar structures which appear as ligands in PDB entries, and compared them to model structures available in Ligand Expo , a curated repository of ligand chemical and structural information. In order to perform the comparison we used several tools for structural comparison currently available (SiteBinder , Open Babel ), as well as two in-house programs. We report here on our findings regarding the complete and correctly annotated sugar structures in PDB, together with the problematic cases.
Brandley BK, Schnaar RL: Cell-surface carbohydrates in cell recognition and response. 1986, 40 (1): 97-111.
Feng Z, Chen L, Maddula H, Akcan O, Oughtred R, Berman HM, Westbrook J: Ligand Depot: a data warehouse for ligands bound to macromolecules. Bioinformatics. 2004, 20 (13): 2153-2155. 10.1093/bioinformatics/bth214.
Sehnal D, Vařeková RS, Huber HJ, Geidl S, Ionescu CM, Wimmerová M, Koča J: SiteBinder: an improved approach for comparing multiple protein structural motifs. J Chem Inf Model. 2012, 52 (2): 343-359. 10.1021/ci200444d.
O'Boyle NM, Banck M, James CA, Morley C, Vandermeersch T, Hutchison GR: Open Babel: An open chemical toolbox. Journal of Cheminformatics. 2011, 3: 33-10.1186/1758-2946-3-33.