Abstract
The concept of chemical similarity has many applications in several fields of cheminformatics. One common use of chemical similarity measurements, based on the principle that similar molecules have similar properties, is in the context of the read-across approach, where estimates of a specific endpoint for a chemical are obtained starting from experimental data available from highly similar compounds.
This chapter reports an implementation of chemical similarity and the analysis of multiple combinations of binary fingerprints and similarity metrics in the context of the read-across technique.
This analysis demonstrates that the classical similarity measurements can be improved with a generalizable model of similarity. The approach presented here has been implemented in two open-source software tools for computational toxicology (CAESAR and VEGA).
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Madan AK, Bajaj S, Dureja H (2013) Classification models for safe drug molecules. In: Reisfeld B, Mayeno AN (eds) Computational toxicology, vol 930. Humana Press, New York, pp 99–124
Read-Across Assessment Framework (RAAF), accessed Sept 2017
http://www.talete.mi.it/, accessed Sept 2017
https://www.mn-am.com/products/adrianacode
https://www.ncbi.nlm.nih.gov/pubmed/29086040,cdk.sf.net
Cereto-Massagué A, Ojeda MJ, Valls C, Mulero M, Garcia-Vallvé S, Pujadas G (2015) Molecular fingerprint similarity search in virtual screening. Methods 71:58–63
Daylight Chemical Information Systems Inc., http://www.daylight.com/
Tripos Inc., http://www.tripos.com
Steinbeck C, Han Y, Kuhn S, Horlacher O, Luttmann E, Willighagen E (2003) The chemistry development kit (CDK): an open-source java library for chemo- and bioinformatics. J Chem Inf Comput Sci 43:493–500
Steinbeck C, Hoppe C, Kuhn S, Floris M, Guha R, Willighagen EL (2006) Recent developments of the chemistry development kit (CDK)–an open-source java library for chemo- and bioinformatics. Curr Pharm Des 12(17):2111–2120
Todeschini R, Consonni V, Xiang H, Holliday J, Buscema M, Willett P (2012) Similarity coefficients for binary chemoinformatics data: overview and extended comparison using simulated and real data sets. J Chem Inf Model 52:2884–2901
Campillos M, Kuhn M, Gavin A-C, Jensen LJ, Bork P (2008) Drug target identification using side-effect similarity. Science 321:263–266
Nickel J, Gohlke B-O, Erehman J, Banerjee P, Rong WW, Goede A, Dunkel M, Preissner R (2014) SuperPred: update on drug classification and target prediction. Nucleic Acids Res 42:W26–W31
Lounkine E, Keiser MJ, Whitebread S, Mikhailov S, Hamon J, Jenkins JL, Lavan P, Weber E, Doak AK, Côté S, Shoichet BK, Urban L (2012) Large-scale prediction and testing of drug activity on side-effect targets. Nature 486:361–367
Drwal MN, Banerjee P, Dunkel M, Wettig MR, Preissner R (2014) ProTox: a web server for the in silico prediction of rodent oral toxicity. Nucleic Acids Res 42:W53–W58
Manganaro A, Pizzo F, Lombardo A, Pogliaghi A, Benfenati E (2016) Predicting persistence in the sediment compartment with a new automatic software based on the k-nearest neighbor (k-NN) algorithm. Chemosphere 144:1624–1630
Drwal MN, Siramshetty VB, Banerjee P, Goede A, Preissner R, Dunkel M (2015) Molecular similarity-based predictions of the Tox21 screening outcome. Front Environ Sci 3:1–9
Floris M, Manganaro A, Nicolotti O, Medda R, Mangiatordi GF, Benfenati E (2014) A generalizable definition of chemical similarity for read-across. J Chem 6:39
VEGA project website: http://www.vega-qsar.eu/
Hall LH, Kier LB (1995) Electrotopological state indices for atom types: a novel combination of electronic, topological, and valence state information. J Chem Inf Comput Sci 35:1039–1045
Klekota J, Roth FP (2008) Chemical substructures that enrich for biological activity. Bioinformatics 24:2518–2525
MACCS Structural Keys. CA (USA): Symyx Software S. R
National Center for Biotechnology Information: “PubChem Substructure Fingerprint v1.3.” PubChem Data Specification 2009 (ftp://ftp.ncbi.nlm.nih.gov/pubchem/specifications/pubchem_fingerprints.txt)
Org.openscience.cdk.fingerprint: SubstructureFingerprinter. http://pele.farmbio.uu.se/nightly/api/org/openscience/cdk/fingerprint/SubstructureFingerprinter.html
Al Khalifa A, Haranczyk M, Holliday J (2009) Comparison of nonbinary similarity coefficients for similarity searching, clustering and compound selection. J Chem Inf Model 49:1193–1201
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Science+Business Media, LLC, part of Springer Nature
About this protocol
Cite this protocol
Floris, M., Olla, S. (2018). Molecular Similarity in Computational Toxicology. In: Nicolotti, O. (eds) Computational Toxicology. Methods in Molecular Biology, vol 1800. Humana Press, New York, NY. https://doi.org/10.1007/978-1-4939-7899-1_7
Download citation
DOI: https://doi.org/10.1007/978-1-4939-7899-1_7
Published:
Publisher Name: Humana Press, New York, NY
Print ISBN: 978-1-4939-7898-4
Online ISBN: 978-1-4939-7899-1
eBook Packages: Springer Protocols