Skip to main content

The Use of Similarity and Clustering Techniques for the Prediction of Molecular Properties

  • Chapter
Applied Multivariate Analysis in SAR and Environmental Studies

Part of the book series: Eurocourses: Chemical and Environmental Science ((EUCE,volume 2))

Abstract

The fine chemicals industry makes extensive use of systems for the storage and manipulation of chemical structure information. The primary function of these systems is to provide facilities for storage and retrieval, but the close relationship that is known to exist between the structure of a molecule and its physical, chemical and biological properties has led to increasing interest in the use of chemical structure databases for the prediction of molecular properties.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  • Adamson, G.W. and Bawden, D. (1981). J. Chem. Inform. Comput. Sci. 21, 204.

    Article  CAS  Google Scholar 

  • Adamson, G.W. and Bush, J.A. (1973). Information Storage and Retrieval 9, 561.

    Article  CAS  Google Scholar 

  • Adamson, G.W. and Bush, J.A. (1975). J. Chem. Inform. Comput. Sci. 15, 55.

    Article  CAS  Google Scholar 

  • Ash, J.E., Chubb, P.A., Ward, S.E., Welford, S.M. and Willett, P. (1985). Communication, Storage and Retrieval of Chemical Information, Ellis Horwood, Chichester.

    Google Scholar 

  • Barnard, J.M. (1989). Perspect. Inform. Manag. 1, 133.

    Google Scholar 

  • Basak, S.C., Magnuson, V.R., Niemi, G.J. and Regal, R.R. (1988) Discrete Appl. Math. 19, 17.

    Article  Google Scholar 

  • Bawden, D. (1983). J. Chem. Inform. Comput. Sci. 23, 14.

    Article  CAS  Google Scholar 

  • Bawden, D. (1990). Applications of two-dimensional chemical similarity measures to database analysis and querying. In, Johnson, M.A. and Maggiora, G.M. (Eds.) Concepts and Applications of Molecular Similarity, Wiley, New York, pp. 65–76.

    Google Scholar 

  • Bawden, D., Catlow, J.T., Devon, T.K., Dalton, J.M., Lynch, M.F. and Willett, P. (1981). J. Chem. Inform. Comput. Sci. 21, 83.

    Article  CAS  Google Scholar 

  • Broto, P. Moreau, G. and Vandycke, C. (1984). Eur. J. Med. Chem. 19, 66.

    CAS  Google Scholar 

  • Carhart, R.E., Smith, D.H. and Venkataraghavan, R. (1985). J. Chem. Inform. Comput. Sci. 25, 64.

    Article  CAS  Google Scholar 

  • Cramer, R.D., Redl, G. and Berkoff, C.E. 91973). J. Med. Chem. 17, 533.

    Google Scholar 

  • Downs, G.M., Gillet V.J., Holliday J.D. and Lynch M.F. (1988). J. Chem. Inform. Comput. Sci. 29, 215.

    Google Scholar 

  • Downs, G.M., Poirrette, A.R., Willett, P. and Walsh, P.T. (1991). Evaluation of similarity searching methods using activity and toxicity data. Proceedings of the Second International Conference on Chemical Structures (Noordwijkerhout, Holland, June 1990). In press.

    Google Scholar 

  • Downs, G.M., Walsh, P.T. and Booth, A.M. (1990). Similarity and clustering of chemical structures for property prediction. Paper presented at the Second International Workshop on Computer Chemistry (Merseburg, Germany, October 1990 ), Health and Safety Executive Section Report, Project R41.35RL.

    Google Scholar 

  • Enslein, K. (1988). Toxicol. Indust. Health 4, 479.

    CAS  Google Scholar 

  • Enslein, K., Borgstedt, H.H., Blake, B.W. and Hart, J.B. (1987). In Vitro Toxicol. 1, 129.

    CAS  Google Scholar 

  • Figueras, J. (1972). J. Chem. Docum. 12, 237.

    Article  CAS  Google Scholar 

  • Franke, R. (1984). Theoretical Drug Design Methods, Elsevier, Amsterdam.

    Google Scholar 

  • Frierson, M.R., Klopman, G. and Rosenkranz, H.S. (1986). Environ. Mutagen. 8, 283.

    Article  CAS  Google Scholar 

  • Gabanyi, Z., Surjan, P. and Naray-Szabo, G. (1982). Eur. J. Med. Chem. 17, 307.

    CAS  Google Scholar 

  • Harrison, P.J. (1968). Appl. Stat. 17, 226.

    Article  Google Scholar 

  • Hodes, L. (1989). J. Chem. Inform. Comput. Sci. 29, 66.

    Article  CAS  Google Scholar 

  • Jarvis, R.A. and Patrick, E.A. (1973). IEEE Trans. Comput. C-22, 1025.

    Article  Google Scholar 

  • Johnson, M.A. (1989). J. Math. Chem. 3, 117.

    Article  CAS  Google Scholar 

  • Johnson, M.A. (1990). Similarity-based methods for predicting chemical and biological properties: a brief overview from a statistical perspective. In, Bawden, D. and Mitchell, E.M. (Eds.) Chemical Information Systems. Beyond the Structure Diagram, Ellis Horwood, Chichester, pp. 149–159.

    Google Scholar 

  • Johnson, M.A., Lajiness, M and Maggiora, G. (1989). Molecular similarity: a basis for designing drug screening programs. In, QSAR: Quantitative Structure-Activity Relationships in Drug Design; Progress in Clinical Biological Research Series 291, Alan R. Liss, Inc. pp. 167–171.

    Google Scholar 

  • Johnson, M.A. and Maggiora, G.M. (Eds.) (1990). Concepts and Applications of Molecular Similarity, Wiley, New York.

    Google Scholar 

  • Kissman, H.M. and Wexler, P. (1985). J. Chem. Inform. Comput. Sci. 25, 212.

    Article  CAS  Google Scholar 

  • Klopman, G. and Raychaudhury, C. (1990). J. Chem. Inform. Comput. Sci. 30, 12.

    Article  CAS  Google Scholar 

  • Lajiness, M.S., Johnson, M.A. and Maggiora, G.M. (1989). Prog. Clin. Biol. Res. 291, 173.

    CAS  Google Scholar 

  • Lipscombe, K.J., Lynch, M.F. and Willett, P. (1989). Ann. Rev. Inform. Sci. Technol. 24, 189.

    Google Scholar 

  • Lyman, W.J., Reehl, W.F. and Rosenblatt, D.H. (Eds.) (1981). Handbook of Chemical Property Estimation, McGraw-Hill, New York.

    Google Scholar 

  • Martin, Y.C., Bures, M.G. and Willett, P. (1990). Searching databases of three-dimensional structures. In, Lipkowitz, K.B. and Boyd, D.B. (Eds.). Reviews in Computational Chemistry, VCH, New York, pp. 213–263.

    Chapter  Google Scholar 

  • Morgan, H.L. (1965). J. Chem. Docum. 5, 107.

    Article  CAS  Google Scholar 

  • Murtagh, F. (1983). Comput. J. 26, 354.

    Article  Google Scholar 

  • Norager, O. (1988). ECDIN, Environmental Chemicals Data and Information Network. In, Warr, W.A. (Ed.). Chemical Structures, Springer-Verlag, Berlin Heidelberg, pp. 195–209.

    Chapter  Google Scholar 

  • Ormerod, A., Willett, P. and Bawden, D. (1989). Quant. Struct.-Activ. Relat. 8, 115.

    Article  CAS  Google Scholar 

  • Pepperrell, C.A., Poirrette, A.R., Willett, P. and Taylor, R. (1991). Development of an atom mapping procedure for similarity searching in databases of three-dimensional chemical structures. Submitted for publication.

    Google Scholar 

  • Pepperrell, C.A. and Willett, P. (1991). Techniques for the calculation of three-dimensional structural similarity using inter-atomic distances. Submitted for publication.

    Google Scholar 

  • Randic, M. and Wilkins, C.L. (1979). J. Chem. Inform. Comput. Sci. 19, 31.

    Article  CAS  Google Scholar 

  • Rosenkranz, H.S. and Klopman, G. (1988). Toxicol. Indust. Health 4, 533.

    CAS  Google Scholar 

  • Rubin, V. and Willett, P. (1983). Anal. Chim. Acta 151, 161.

    Article  CAS  Google Scholar 

  • Sneath, P.H.A. and Sokal, R.R. (1973). Numerical Taxonomy, Freeman, San Francisco.

    Google Scholar 

  • Tarjan, R.E. (1977). Amer. Chem. Soc. Symp. Ser. 46, 1.

    CAS  Google Scholar 

  • Tosato, M.L., Marchini, S., Passerini, L., Pino, A., Eriksson, L., Lindgren, F., Hellberg, S., Jonsson, J., Sjostrom, M., Skagerberg, B. and Wold, S. (1990). Env. Toxicol. Chem. 9, 265.

    Article  CAS  Google Scholar 

  • Warr, W.E. (Ed.) (1988). Chemical Structures. The International Language of Chemistry, Springer, Berlin.

    Google Scholar 

  • Weininger, D. (1988). J. Chem. Inform. Comput. Sci. 28, 31.

    Article  CAS  Google Scholar 

  • Wilkins, C.L. and Randic, M. (1980). Theor. Chim. Acta 58, 45.

    Article  CAS  Google Scholar 

  • Willett, P. (1982). Anal. Chim. Acta 136, 29.

    Article  CAS  Google Scholar 

  • Willett, P. (1983). J. Chem. Inf. Comput. Sci. 23, 22.

    Article  CAS  Google Scholar 

  • Willett, P. (1984). J. Chem. Inform. Comput. Sci. 24, 29.

    Article  CAS  Google Scholar 

  • Willett, P. (1987). Similarity and Clustering in Chemical Information Systems, Research Studies Press, Letchworth.

    Google Scholar 

  • Willett, P. (1990). Algorithms for the calculation of similarity in chemical structure databases. In, Johnson, M.A. and Maggiora, G.M. (Eds.) Concepts and Applications of Molecular Similarity, Wiley, New York, pp. 43–63.

    Google Scholar 

  • Willett, P. (1991). Three-Dimensional Chemical Structure Handling, Research Studies Press, Taunton.

    Google Scholar 

  • Willett, P. and Downs G.M. (1989). Clustering of chemical structure databases. An investigation for the EC Joint Research Centre, Department of Information Studies, University of Sheffield.

    Google Scholar 

  • Willett, P. and Winterman, V. (1986). Quant. Struct.-Activ. Relat. 5, 18.

    Article  CAS  Google Scholar 

  • Willett, P., Winterman, V. and Bawden, D. (1986a). J. Chem. Inform. Comput. Sci. 26, 36.

    Article  CAS  Google Scholar 

  • Willett, P., Winterman, V. and Bawden, D. (1986b). J. Chem. Inform. Comput. Sci. 26, 109.

    Article  CAS  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 1991 Springer Science+Business Media Dordrecht

About this chapter

Cite this chapter

Downs, G.M., Willett, P. (1991). The Use of Similarity and Clustering Techniques for the Prediction of Molecular Properties. In: Devillers, J., Karcher, W. (eds) Applied Multivariate Analysis in SAR and Environmental Studies. Eurocourses: Chemical and Environmental Science, vol 2. Springer, Dordrecht. https://doi.org/10.1007/978-94-011-3198-8_8

Download citation

  • DOI: https://doi.org/10.1007/978-94-011-3198-8_8

  • Publisher Name: Springer, Dordrecht

  • Print ISBN: 978-94-010-5410-2

  • Online ISBN: 978-94-011-3198-8

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics