Fingerprint-based detection of acute aquatic toxicity
- 764 Downloads
KeywordsEnvironmental Protection Agency Protection Agency Classification Model Toxic Compound Molecular Descriptor
In this work we show the effectiveness of 2D structural fingerprints in the prediction of aquatic toxicity of chemical compounds, creating a self-contained system for structure-based aquatic toxicity classification. Using the data from the U.S. Environmental Protection Agency Fat Head Minnow (EPA-FHM) dataset  we build a non-linear RBF SVM  classifier that distinguishes acutely toxic compounds from less toxic compounds, loosely according to the criterion stipulated by the E.U. Reach legislation . The classifier achieves up to 86% accuracy in leave-one-out validation using 580 of the dataset's 614 compounds. This performance is comparable with models built from the same dataset using more sophisticated molecular descriptors, such as AutoMEP and Sterimol descriptors . We apply our classification model to predict the aquatic toxicity of 3M compounds in the MMsINC database . Furthermore, we create a linear SVM model using the same technique and apply it to the MMsINC data, with the additional integration of the EXPLAIN system  which allows us to show which structural features are responsible for the model classifying a molecule as less toxic or acutely toxic.
- 2.Boser B, Guyon I, Vapnik V: A Training Algorithm for Optimal Margin Classifiers. Computational Learning Theory. 1992, 144-152.Google Scholar
- 3.EU: Corrigendum to Regulation (EC) No 1907/2006 of the European Parliament and of the Council of 18 December 2006 concerning the Registration, Evaluation, Authorization and Restriction of Chemicals (REACH). Off J Eur Union L136. 2007, 50-Google Scholar
- 4.Michielan L, Pireddu L, Floris M, Bacilieri M, Rodriguez-Tomé P, Moro S: 2009.Google Scholar
- 5.Masciocchi B, Frau G, Fanton M, Sturlese M, Floris M, Pireddu L, Palla P, Cedrati F, RodriguezTomé P, Moro S: MMsINC: a large-scale chemoinformatics database. Nucleic Acids Research. 2008, D284-90. 37 DatabaseGoogle Scholar
- 6.Poulin B, Eisner R, Szafron D, Lu P, Greiner R, Wishart D, Fyshe A, Pearcy B, Macdonell C, Anvik J: Visual Explanation of Evidence in Additive Classifiers. Innovative Applications of Artificial Intelligence. 2006Google Scholar
This article is published under license to BioMed Central Ltd.