Physicochemical Property Labels as Molecular Descriptors for Improved Analysis of Compound–Protein and Compound–Compound Networks

  • Masaaki KoteraEmail author
Part of the Methods in Molecular Biology book series (MIMB, volume 1825)


Small molecules can be represented in various file formats, (1) one-line systems such as SMILES (Simplified Molecular Input Line Entry System) and InChI (International Chemical Identifier) and (2) table systems such as the molfiles, SDF (Structure Data File), and KCF (KEGG Chemical Function). KCF and KCF-S (KEGG Chemical Function-and-Substructures) apply physicochemical property labels on the representations of small molecules, and contribute to improved analysis of compoundprotein networks including drugtarget interaction, and compoundcompound networks including metabolic pathways. In this chapter, the main concepts, usage, and some example applications of the KCFCO and KCF-S packages are explained.

Key words

Molecular file formats Chemical fingerprints Chemical descriptors Compound–protein network Drug–target interaction Compound–compound network Metabolic pathway 



Funding from the Ministry of Education, Culture, Sports, Science and Technology of Japan, the Japan Science and Technology Agency, and the Japan Society for the Promotion of Science; JSPS Kakenhi (25108714,). This work was also supported by the Program to Disseminate Tenure Tracking System, MEXT, Japan.


  1. 1.
    Clark DE, Pickett SD (2000) Computational methods for the prediction of ‘drug-likeness’. Drug Discov Today 5:49–58CrossRefGoogle Scholar
  2. 2.
    Yamanishi Y, Kotera M, Kanehisa M, Goto S (2010) Drug-target interaction prediction from chemical, genomic and pharmacological data in an integrated framework. Bioinformatics 26:i246–i254CrossRefGoogle Scholar
  3. 3.
    Kotera M, Goto S (2016) Metabolic pathway reconstruction strategies for central metabolism and natural product biosynthesis. Biophys Physicobiol 13:195–205CrossRefGoogle Scholar
  4. 4.
    Weininger D (1970) SMILES, a chemical language and information system. 1. Introduction to methodology and encoding rules. Proc. Edinburgh Math. SOC, Vol. 17, pp. 1–14Google Scholar
  5. 5.
    Heller SR, McNaught A, Pletnev I, Stein S, Tchekhovskoi D (2015) InChI, the IUPAC international chemical identifier. J Cheminform 7:23. Scholar
  6. 6.
    Hattori M, Okuno Y, Goto S, Kanehisa M (2003) Development of a chemical structure comparison method for integrated analysis of chemical and genomic information in the metabolic pathways. J Am Chem Soc 125:11853–11865CrossRefGoogle Scholar
  7. 7.
    Hastings J, Owen G, Dekker A, Ennis M, Kale N, Muthukrishnan V, Turner S, Swainston N, Mendes P, Steinbeck C (2016) ChEBI in 2016: improved services and an expanding collection of metabolites. Nucleic Acids Res 44:D1214–D1219. Scholar
  8. 8.
    Brecher JS (1998) The chemfinder webserver: indexing chemical data on the internet. CHIMIA Int J Chem 52:658–663Google Scholar
  9. 9.
    Pence HE, Williams A (2010) ChemSpider: an online chemical information resource. J Chem Educ 87:1123–1124CrossRefGoogle Scholar
  10. 10.
  11. 11.
  12. 12.
  13. 13.
  14. 14.
  15. 15.
  16. 16.
  17. 17.
  18. 18.
  19. 19.
  20. 20.
  21. 21.
  22. 22.
  23. 23.
    Steinbeck C, Han Y, Kuhn S, Horlacher O, Luttmann E, Willighagen E (2003) The chemistry development kit (CDK): an open-source java library for chemo- and bioinformatics. J Chem Inf Comput Sci 43:493–500CrossRefGoogle Scholar
  24. 24.
    Hall LH, Kier LB (1995) Electrotopological state indices for atom types: a novel combination of electronic, topological, and valence state information. J Chem Inf Comput Sci 35:1039–1045CrossRefGoogle Scholar
  25. 25.
    Klekota J, Roth FP (2008) Chemical substructures that enrich for biological activity. Bioinformatics 24:2518–2525CrossRefGoogle Scholar
  26. 26.
    Durant J et al (2002) Reoptimization of MDL keys for use in drug discovery. J Chem Inf Comput Sci 42:1273–1280CrossRefGoogle Scholar
  27. 27.
    Chen B et al (2009) PubChem as a source of polypharmacology. J Chem Inf Model 49:2044–2055CrossRefGoogle Scholar
  28. 28.
    Kotera M et al (2013) KCF-S: KEGG chemical function and substructure for improved interpretability and prediction in chemical bioinformatics. BMC Syst Biol 7(Suppl 6):S2CrossRefGoogle Scholar
  29. 29.
    Sawada R, Kotera M, Yamanishi Y (2014) Benchmarking a wide range of chemical descriptors for drug-target interaction prediction using a chemogenomic approach. Mol Informatics 33:719–731. Scholar
  30. 30.
  31. 31.
  32. 32.

Copyright information

© Springer Science+Business Media, LLC, part of Springer Nature 2018

Authors and Affiliations

  1. 1.Department of Chemical System Engineering, School of EngineeringThe University of TokyoTokyoJapan

Personalised recommendations