Towards automated metabolome assembly: application of text mining to correlate small molecules, targets and tissues

Moreno, P; Jayaseelan, KV; Steinbeck, C

doi:10.1186/1758-2946-3-S1-P19

Volume 3 Supplement 1

6th German Conference on Chemoinformatics, GCC 2010

Poster presentation
Open access
Published: 19 April 2011

Towards automated metabolome assembly: application of text mining to correlate small molecules, targets and tissues

P Moreno¹,
KV Jayaseelan¹ &
C Steinbeck¹

Journal of Cheminformatics volume 3, Article number: P19 (2011) Cite this article

1644 Accesses
1 Citations
Metrics details

How many species are there on Earth? Single celled organisms aside a few million are known. How many species had their genomes sequenced so far? Around two thousand. Approaching the era of a new genome sequence per week, it is fair to wonder: How many metabolomes have been compiled? The metabolome could be considered the ultimate phenotypic expression of the cell, and yet, we barely have one [1]. The metabolome refers to the complete set of small molecules (< 1500Da) present on a biological sample or organism [2]. Among all omics, metabolomics is probably the most unique to each individual in terms of the variation of its elements. Here we define Automated Metabolome Assembly (AMA) as a set of techniques to predict the metabolome of an organism based on the complete set of boundary information available (gene sequence, proteomics, bibliomics, etc.). As a first step towards Automated Metabolome Assembly we report the implementation of a text mining resource based on the existing EBI text mining infrastructure [3] to address the problem of finding co-occurrences of chemical entities, proteins, organisms, and tissues/cell types terms. We created a workflow based on a database holding 365 million of relations between these terms (proteins, metabolites, organisms and tissues) and PubMed citations, obtained from the whole PubMed collection up to September 2009. Dictionaries of terms for each kind of entity were generated from different ontologies. All known metabolites present in liver were obtained from the latest version of HMDB. The text mining results were compared to this reference set. Close to 90% of the reference set shows co-occurence of our liver-related tissue ontology terms with the respective metabolite names, demonstrating that this text-mining workflow can form an important building block for a comprehensive system for metabolome prediction.

References

Wishart D: Hmdb: the human metabolome database. Nucleic Acids Res. 2007, 35 (Database issue): D521-10.1093/nar/gkl923.
Article CAS Google Scholar
Wishart D: Current progress in computational metabolomics. Briefings in Bioinformatics. 2007, 8: 279-10.1093/bib/bbm030.
Article CAS Google Scholar
Rebholz-Schuhmann D, Arregui M, Gaudan S, Kirsch H, Jimeno A: Text processing through web services: calling whatizit. Bioinformatics. 2008, 24: 296-10.1093/bioinformatics/btm557.
Article CAS Google Scholar

Download references

Author information

Authors and Affiliations

Cheminformatics and Metabolism Group, European Bioinformatics Institute (EBI), Cambridge, CB10 1SD, UK
P Moreno, KV Jayaseelan & C Steinbeck

Authors

P Moreno
View author publications
You can also search for this author in PubMed Google Scholar
KV Jayaseelan
View author publications
You can also search for this author in PubMed Google Scholar
C Steinbeck
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to P Moreno.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Moreno, P., Jayaseelan, K. & Steinbeck, C. Towards automated metabolome assembly: application of text mining to correlate small molecules, targets and tissues. J Cheminform 3 (Suppl 1), P19 (2011). https://doi.org/10.1186/1758-2946-3-S1-P19

Download citation

Published: 19 April 2011
DOI: https://doi.org/10.1186/1758-2946-3-S1-P19

6th German Conference on Chemoinformatics, GCC 2010

Towards automated metabolome assembly: application of text mining to correlate small molecules, targets and tissues

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Journal of Cheminformatics

Contact us

6th German Conference on Chemoinformatics, GCC 2010

Towards automated metabolome assembly: application of text mining to correlate small molecules, targets and tissues

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Journal of Cheminformatics

Contact us