- Poster presentation
- Open Access
Fingerprint-based detection of acute aquatic toxicity
Journal of Cheminformaticsvolume 2, Article number: P46 (2010)
In this work we show the effectiveness of 2D structural fingerprints in the prediction of aquatic toxicity of chemical compounds, creating a self-contained system for structure-based aquatic toxicity classification. Using the data from the U.S. Environmental Protection Agency Fat Head Minnow (EPA-FHM) dataset  we build a non-linear RBF SVM  classifier that distinguishes acutely toxic compounds from less toxic compounds, loosely according to the criterion stipulated by the E.U. Reach legislation . The classifier achieves up to 86% accuracy in leave-one-out validation using 580 of the dataset's 614 compounds. This performance is comparable with models built from the same dataset using more sophisticated molecular descriptors, such as AutoMEP and Sterimol descriptors . We apply our classification model to predict the aquatic toxicity of 3M compounds in the MMsINC database . Furthermore, we create a linear SVM model using the same technique and apply it to the MMsINC data, with the additional integration of the EXPLAIN system  which allows us to show which structural features are responsible for the model classifying a molecule as less toxic or acutely toxic.
Russom CL, Bradbury SP, Broderius SJ, Hammermeister DE, Drummond RA: Predicting modes of action from chemical structure: Acute toxicity in the fathead minnow (Pimephales promelas). Environmental Toxicology and Chemistry. 1997, 16 (5): 948-967.
Boser B, Guyon I, Vapnik V: A Training Algorithm for Optimal Margin Classifiers. Computational Learning Theory. 1992, 144-152.
EU: Corrigendum to Regulation (EC) No 1907/2006 of the European Parliament and of the Council of 18 December 2006 concerning the Registration, Evaluation, Authorization and Restriction of Chemicals (REACH). Off J Eur Union L136. 2007, 50-
Michielan L, Pireddu L, Floris M, Bacilieri M, Rodriguez-Tomé P, Moro S: 2009.
Masciocchi B, Frau G, Fanton M, Sturlese M, Floris M, Pireddu L, Palla P, Cedrati F, RodriguezTomé P, Moro S: MMsINC: a large-scale chemoinformatics database. Nucleic Acids Research. 2008, D284-90. 37 Database
Poulin B, Eisner R, Szafron D, Lu P, Greiner R, Wishart D, Fyshe A, Pearcy B, Macdonell C, Anvik J: Visual Explanation of Evidence in Additive Classifiers. Innovative Applications of Artificial Intelligence. 2006