Abstract
In this chapter, we discuss how the profusion of experimental chemogenomics data available in public repositories is transforming the field of cheminformatics. In particular, we describe (i) both theoretical and technical challenges related to the management, analysis, and visualization of large and diverse chemical datasets, (ii) the unique opportunities offered by Big Chemical Data for designing molecules with the desired properties and expanding the use of cheminformatics in novel areas of research, and (iii) some innovative approaches that are likely to shape the future of cheminformatics.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Chute CG, Ullman-Cullere M, Wood GM, Lin SM, He M, Pathak J (2013) Some experiences and opportunities for big data in translational research. Genet Med 15:802–809
Moore KD, Eyestone K, Coddington DC (2013) The big deal about big data. Healthc Financ Manage 67:60–68
Tropsha A (2010) Best practices for QSAR model development, validation, and exploitation. Mol Inform 29:476–488
Varnek A, Fourches D, Sieffert N, Solov’ev VP, Hill C, Lecomte M (2007) QSPR Modeling of the Am III/Eu III separation factor: how far can we predict? J Solv Extr Ion Exch 25:1–26
Varnek A, Fourches D, Solov’ev V, Klimchuk O, Ouadi A, Billard I (2007) Successful “In Silico” design of new efficient uranyl binders. J Solv Extr Ion Exch 25:433–462
Carnero A (2006) High throughput screening in drug discovery. Clin Transl Oncol 8:482–490
Kozikowski AP, Roth B, Tropsha A (2006) Why academic drug discovery makes sense. Science 313:1235–1236
Bajorath J (2013) A perspective on computational chemogenomics. Mol Inform 32:1025–1028
Keiser MJ, Setola V, Irwin JJ et al (2009) Predicting new molecular targets for known drugs. Nature 462:175–181
Xie X-Q (2010) Exploiting PubChem for virtual screening. Expert Opin Drug Discov 5:1205–1220
Williams AJ (2014) Introduction to Chemspider. http://www.chemspider.com/help-what-can-i-do-with-chemspider.aspx. Accessed 8 Nov 2014.
Irwin JJ, Shoichet BK (2005) ZINC—a free database of commercially available compounds for virtual screening. J Chem Inf Model 45:177–182
Sipes NS, Martin MT, Kothiya P, Reif DM, Judson RS, Richard AM, Houck KA, Dix DJ, Kavlock RJ, Knudsen TB (2013) Profiling 976 ToxCast chemicals across 331 enzymatic and receptor signaling assays. Chem Res Toxicol 26:878–895
Huang R, Sakamuru S, Martin MT et al (2014) Profiling of the Tox21 10K compound library for agonists and antagonists of the estrogen receptor alpha signaling pathway. Sci Rep 4:5664–5673
Tice RR, Austin CP, Kavlock RJ, Bucher JR (2013) Improving the human hazard characterization of chemicals: a Tox21 update. Environ Health Perspect 121:756–765
Knudsen TB, Martin MT, Kavlock RJ, Judson RS, Dix DJ, Singh AV (2009) Profiling the activity of environmental chemicals in prenatal developmental toxicity studies using the U.S. EPA’s ToxRefDB. Reprod Toxicol 28:209–219
Kuhn M, Szklarczyk D, Franceschini A, von Mering C, Jensen LJ, Bork P (2012) STITCH 3: zooming in on protein-chemical interactions. Nucleic Acids Res 40:D876–D880
Kanehisa M, Goto S, Sato Y, Furumichi M, Tanabe M (2012) KEGG for integration and interpretation of large-scale molecular data sets. Nucleic Acids Res 40:D109–D114
Polishchuk PG, Madzhidov TI, Varnek A (2013) Estimation of the size of drug-like chemical space based on GDB-17 data. J Comput Aided Mol Des 27:675–679
Ruddigkeit L, Blum LC, Reymond J-L (2013) Visualization and virtual screening of the chemical universe database GDB-17. J Chem Inf Model 53:56–65
Jorgensen WL (2006) QSAR/QSPR and proprietary data. J Chem Inf Model 46:937–937
Young D, Martin D, Venkatapathy R, Harten P (2008) Are the chemical structures in your QSAR correct? QSAR Comb Sci 27:1337–1345
Ekins S, Olechno J, Williams AJ (2013) Dispensing processes impact apparent biological activity as determined by computational and statistical analyses. PLoS One 8:e62325
Prinz F, Schlange T, Asadullah K (2011) Believe it or not: how much can we rely on published data on potential drug targets? Nat Rev Drug Discov 10:712
Fourches D, Muratov E, Tropsha A (2010) Trust, but verify: on the importance of chemical structure curation in cheminformatics and QSAR modeling research. J Chem Inf Model 50:1189–1204
Baker NC, Hemminger BM (2010) Mining connections between chemicals, proteins, and diseases extracted from Medline annotations. J Biomed Inform 43:510–519
Fourches D, Barnes JC, Day NC, Bradley P, Reed JZ, Tropsha A (2010) Cheminformatics analysis of assertions mined from literature that describe drug-induced liver injury in different species. Chem Res Toxicol 23:171–183
Low Y, Uehara T, Minowa Y et al (2011) Predicting drug-induced hepatotoxicity using QSAR and toxicogenomics approaches. Chem Res Toxicol 24:1251–1262
Low Y, Sedykh AY, Fourches D, Golbraikh A, Whelan M, Rusyn I, Tropsha A (2013) Integrative chemical-biological read-across approach for chemical hazard classification. Chem Res Toxicol 26:1199–1208
Donaldson K, Poland CA (2009) Nanotoxicology: new insights into nanotubes. Nat Nanotechnol 4:708–710
Balbus JM, Florini K, Denison RA, Walsh SA (2006) Getting it right the first time: developing nanotechnology while protecting workers, public health, and the environment. Ann N Y Acad Sci 1076:331–342
Hart P (2009) Nanotechnology, synthetic biology, & public opinion. The Woodrow Wilson International Center For Scholars
Bystrzejewska-Piotrowska G, Golimowski J, Urban PL (2009) Nanoparticles: their potential toxicity, waste and environmental management. Waste Manag 29:2587–2595
Jones R (2009) Nanotechnology, energy and markets. Nat Nanotechnol 4:75
Lockman PR, Mumper RJ, Khan MA, Allen DD (2002) Nanoparticle technology for drug delivery across the blood-brain barrier. Drug Dev Ind Pharm 28:1–13
Linkov I, Satterstrom FK, Corey LM (2008) Nanotoxicology and nanomedicine: making hard decisions. Nanomedicine 4:167–171
Fourches D, Pu D, Tassa C, Weissleder R, Shaw SY, Mumper RJ, Tropsha A (2010) Quantitative nanostructure-activity relationship modeling. ACS Nano 4:5703–5712
Fourches D, Pu D, Tropsha A (2011) Exploring quantitative nanostructure-activity relationships (QNAR) modeling as a tool for predicting biological effects of manufactured nanoparticles. Comb Chem High Throughput Screen 14:217–225
Heinzerling L, Klein R, Rarey M (2012) Fast force field-based optimization of protein-ligand complexes with graphics processor. J Comput Chem 33:2554–2565
Fourches D, Tropsha A (2013) Using graph indices for the analysis and comparison of chemical datasets. Mol Inform 32:827–842
Rognan D (2013) Towards the next generation of computational chemogenomics tools. Mol Inform 32:1029–1034
Acknowledgments
The author sincerely thanks Profs. Alexandre Varnek (University of Strasbourg, France) and Alexander Tropsha (University of North Carolina at Chapel Hill, USA) for fruitful discussions, training, support and trust. This chapter has been proofread by Dr. Laura Widman (University of North Carolina at Chapel Hill, USA). Financial support from NSF ABI 1147145, EPA RD832720, SRC/Sematech, and UNC Junior Faculty Award is also gratefully acknowledged.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer Science+Business Media Dordrecht
About this chapter
Cite this chapter
Fourches, D. (2014). Cheminformatics: At the Crossroad of Eras. In: Gorb, L., Kuz'min, V., Muratov, E. (eds) Application of Computational Techniques in Pharmacy and Medicine. Challenges and Advances in Computational Chemistry and Physics, vol 17. Springer, Dordrecht. https://doi.org/10.1007/978-94-017-9257-8_16
Download citation
DOI: https://doi.org/10.1007/978-94-017-9257-8_16
Published:
Publisher Name: Springer, Dordrecht
Print ISBN: 978-94-017-9256-1
Online ISBN: 978-94-017-9257-8
eBook Packages: Chemistry and Materials ScienceChemistry and Material Science (R0)