Summary
A new and general approach to forming Structure Activity Relationships (SARs) is described. This is based on representing chemical structure by atoms and their bond connectivities in combination with the Inductive Logic Programming (ILP) algorithm Progol. Existing SAR methods describe chemical structure using attributes which are general properties of an object. It is not possible to map directly chemical structure to attribute-based descriptions, as such descriptions have no internal organisation. A more natural and general way to describe chemical structure is to use a relational description, where the internal construction of the description maps that of the object described. Our atom and bond connectivities representation is a relational description. ILP algorithms can form SARs with relational descriptions. We have tested the relational approach by investigating the SAR of 230 aromatic and heteroaromatic nitro compounds. These compounds had been split previously into two sub-sets, 188 compounds that were amenable to regression, and 42 that were not. For the 188 compounds, a SAR was found that was as accurate as the best statistical or neural network generated SARs. The Progol SAR has the advantages that it did not need the use of any indicator variables hand-crafted by an expert, and the generated rules were easily comprehensible. For the 42 compounds, Progol formed a SAR that was significantly (P < 0.025) more accurate than linear regression, quadratic regression, and back-propagation. This SAR is based on a new automatically generated structural alert for mutagenicity.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Hansch, C., Maloney, P.P. Fujita, T. and Muir, R.M. (1962) Correlation of biological activity of phenoxyacetic acids with Hammett substitution constants and partition coefficients. Nature, 194 178–180.
Frank, I. E. & Friedman, J. H. (1993) A statistical view of some chemometrics regression tools. Technometrics 35, 109–135.
Hirst, J. D., King, R. D. & Sternberg, M. J. E. (1994) Quantatitive Structure-Activity Relationships: Neural Networks and Inductive Logic Programming compared against statistical methods: I. The Inhibition of Dihydrofolate Reductase by pyrimidines J. Comput.-Aided Mol. Des. 8, 405–420.
Hirst, J. D., King, R. D. & Sternberg, M. J. E. (1994) Quantatitive Structure-Activity Relationships: Neural Networks and Inductive Logic Programming compared against statistical methods: II. The Inhibition of Dihydrofolate Reductase by triazines. Comput.-Aided Mol. Des. 8, 405–420.
King, R. D., J. D. Hirst, and M. J. E. (1994) A comparison of artificial intelligence methods for modelling pharmaceutical QSARs. Sternberg, Applied Artificial Intelligence. 9, 213–234.
Debnath, A. K., Lopez de Compadre, R. L., Debnath, G., Shusterman, A. J. & Hansch, C. (1991) Structure-Activity relationship of mutagenic aromatic and heteroaromatic nitro compounds. correlation with molecular orbital energies and hydrophobicity J. Med. Chem. 34, 786–797.
Ramsden, C. (1990) Comprehensive Medicinal Chemistry 4 (Pergamon Press, Oxford).
Trinajstic, N. (1983) Chemical Graph Theory. Chemical Graph Theory (CRC Press, Boca Raton, FL.)
Klopman, G. (1984) Artificial Intelligence approach to structure-activity studies: computer automated structure evaluation of biological activity of organic molecules J. Am. Chem. Soc. 106, 7315–7321.
Ormerod, A., Willet, P. & Bawden, D. (1989) Comparison of fragment weighting schemes for substructural analysis. Quant. struct.-Act. Relat. 8, 115–129.
Hopfinger, A. J. (1980) A QSAR Investigation of Dihydrofolate Reductase inhibition by baker triazines based upon molecular shape analysis. J. Am. Chem. Soc. 102, 7196–7206.
Muggleton, S. (1991) Inductive Logic Programming. New Gen. Computing 8, 295–318.
Lavrac, N. & Dzeroski, S. (1994) Inductive Logic Programming Techniques and Applications (Ellis Horwood, London).
DeLong, H. (1970) A Profile of Mathematical Logic (Addison-Wesley, Reading, MA).
King, R. D., Muggleton, S. H., Srinivasan, A. & Sternberg, M. J. E. (1996) Structure-activity relationships derived by machine learning: The use of atoms and their bond connectivities to predict mutagenicity by inductive logic programming. Proc. Natl. Acad. Sci . USA. 93, 438–442.
Muggleton, S. H., (1995) Mode directed inverse resolution. New Generation Computing 13, 245–286.
Villemin, D., Cherqaoui, D. & Cense, J. M. (1993) Neural networks studies: quantitative structure-activity relationship of mutagenic aromatic nitro compounds. J. Chim. Phys. 90, 1505–1519.
Wallace, C. S. & Freeman, P. R. (1987) Estimation and Inference by compact coding. J. R. Statist. Soc. B. 49, 195–209.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1998 Springer Science+Business Media Dordrecht
About this chapter
Cite this chapter
King, R.D., Sternberg, M.J.E., Muggleton, S.H., Srinivasan, A. (1998). Recent Developments in Applying Machine Learning to Drug Design. In: Codding, P.W. (eds) Structure-Based Drug Design. NATO ASI Series, vol 352. Springer, Dordrecht. https://doi.org/10.1007/978-94-015-9028-0_14
Download citation
DOI: https://doi.org/10.1007/978-94-015-9028-0_14
Publisher Name: Springer, Dordrecht
Print ISBN: 978-90-481-5078-6
Online ISBN: 978-94-015-9028-0
eBook Packages: Springer Book Archive