- Oral presentation
- Open Access
Semantics vs. statistics in chemical markup
Journal of Cheminformatics volume 4, Article number: O16 (2012)
Since the late 1990s, natural language processing (NLP) has seen a massive shift from high-precision, low-recall systems based on small sets of hand-written rules, to methods based on the statistical analysis of large corpora. The field of chemoinformatics, likewise, is dominated by statistical and machine-learning approaches. In recent years, however, pharmaceutical companies have been engaging more and more with Semantic Web technologies, which are largely built around the sorts of hand-written systems that NLP has moved away from this century. We discuss where our current text analysis and Semantic Web efforts at the Royal Society of Chemistry are headed and how we're making use of the unreasonable effectiveness of data.
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
About this article
Cite this article
Batchelor, C. Semantics vs. statistics in chemical markup. J Cheminform 4 (Suppl 1), O16 (2012). https://doi.org/10.1186/1758-2946-4-S1-O16
- Natural Language
- Pharmaceutical Company
- Royal Society
- Text Analysis
- Language Processing