Biochemical Knowledge Discovery Using Inductive Logic Programming

Muggleton, Stephen; Srinivasan, Ashwin; King, R. D.; Sternberg, M. J. E.

doi:10.1007/3-540-49292-5_29

Stephen Muggleton³,
Ashwin Srinivasan⁴,
R. D. King⁵ &
…
M. J. E. Sternberg⁶

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 1532))

Included in the following conference series:

International Conference on Discovery Science

584 Accesses
10 Citations

Abstract

Machine Learning algorithms are being increasingly used for knowledge discovery tasks. Approaches can be broadly divided by distinguishing discovery of procedural from that of declarative knowledge. Client requirements determine which of these is appropriate. This paper discusses an experimental application of machine learning in an area related to drug design. The bottleneck here is in finding appropriate constraints to reduce the large number of candidate molecules to be synthesised and tested. Such constraints can be viewed as declarative specifications of the structuralel ements necessary for high medicinal activity and low toxicity. The first-order representation used within Inductive Logic Programming (ILP) provides an appropriate description language for such constraints. Within this application area knowledge accreditation requires not only a demonstration of predictive accuracy but also, and crucially, a certification of novel insight into the structural chemistry. This paper describes an experiment in which the ILP system Progolw as used to obtain structural constraints associated with mutagenicity of molecules. In doing so Progol found a new indicator of mutagenicity within a subset of previously published data. This subset was already known not to be amenable to statistical regression, though its complement was adequately explained by a linear model. According to the combined accuracy/explanation criterion provided in this paper, on both subsets comparative trials show that Progol’s structurally-oriented hypotheses are preferable to those of other machine learning algorithms

The results in this paper are published separately in [7,16]

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

J. Black. Drugs from emasculated hormones: the principle of syntopic antagonism. Bioscience Reports, 9(3), 1989. Published in Les Prix Nobel, 1988. Printed in Sweden by Nostedts Tryckeri, Stockholm.
Google Scholar
L. Breiman, J.H. Friedman, R.A. Olshen, and C.J. Stone. Classification and Regression Trees. Wadsworth, Belmont, 1984.
Google Scholar
W. Buntine. Ind package of machine learning algorithms. Technical Report 244-17, Research Institute for Advanced Computer Science, NASA Ames Research Center, Moffett Field, CA 94035, 1992.
Google Scholar
A.K. Debnath, R.L Lopez de Compadre, G. Debnath, A.J. Schusterman, and C. Hansch. Structure-Activity Relationship of Mutagenic Aromatic and Heteroaromatic Nitro compounds. Correlation with molecular orbital energies and hydrophobicity. Journal of Medicinal Chemistry, 34(2):786–797, 1991.
Article Google Scholar
P. Finn, S. Muggleton, D. Page, and A. Srinivasan. Pharmacophore Discovery using the Inductive Logic Programming system Progol Machine Learning, 30: 241–271, 1998.
Article Google Scholar
C.W. Gear. Numerical Initial Value Problems is Ordinary Differential Equations. Prentice-Hall, Edgewood Cliffs, NJ, 1971.
Google Scholar
R.D. King, S.H. Muggleton, A. Srinivasan, and M.J.E. Sternberg. Structure-activity relationships derived by machine learning: The use of atoms and their bond connectivities to predict mutagenicity by inductive logic programming. Proc. of the National Academy of Sciences, 93:438–442, 1996.
Article Google Scholar
J. McCarthy. Programs with commonsense. In Mechanisation of thought processes (v1). Her Majesty’s Stationery Office, London, 1959. Reprinted with an additional section in Semantic Information Processing.
Google Scholar
R.S. Michalski. A theory and methodology of inductive learning. In R. Michalski, J. Carbonnel, and T. Mitchell, editors, Machine Learning: An Artificial Intelligence Approach, pages 83–134. Tioga, Palo Alto, CA, 1983.
Google Scholar
D. Michie. The superarticulacy phenomenon in the context of software manufacture. Proceedings of the Royal Society of London, A 405:185–212, 1986. Reprinted in: The Foundations of Artificial Intelligence: a Sourcebook (eds. D. Partridge and Y. Wilks), Cambridge University Press, 1990.
Google Scholar
D. Michie, D.J. Spiegelhalter, and C.C. Taylor, editors. Machine Learning, Neural and Statistical classification. Ellis-Horwood, New York, 1994.
MATH Google Scholar
S. Muggleton. Inverse Entailment and Progol. New Gen. Comput., 13:245–286, 1995.
Article Google Scholar
S. Muggleton and L. De Raedt. Inductive logic programming: Theory and methods. Journal of Logic Programming, 19,20:629–679, 1994.
Article MathSciNet Google Scholar
A. J. Owens and D. L. Filkin. Efficient training of the back-propagation network by solving a system of stiff ordinary differential equations. In Proceedings IEEE/INNS International Joint Conference of Neural Networks, pages 381–386, Washington DC, 1989.
Google Scholar
G. Ryle. The Concept of Mind. Hutchinson, 1949.
Google Scholar
A. Srinivasan, S.H. Muggleton, R.D. King, and M.J.E. Sternberg. Theories for mutagenicity: a study of first-order and feature based induction. Artificial Intelligence, 85:277–299, 1996.
Article Google Scholar
D. Villemin, D. Cherqaoui, and J.M. Cense. Neural network studies: quantitative structure-activity relationship of mutagenic aromatic nitro compounds. J. Chim. Phys, 90:1505–1519, 1993.
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, University of York, USA
Stephen Muggleton
Computing Laboratory, University of Oxford, UK
Ashwin Srinivasan
Department of Computer Science, The University of Wales Aberystwyth, USA
R. D. King
Biomolecular Modelling Laboratory, Imperial Cancer Research Fund, USA
M. J. E. Sternberg

Authors

Stephen Muggleton
View author publications
You can also search for this author in PubMed Google Scholar
Ashwin Srinivasan
View author publications
You can also search for this author in PubMed Google Scholar
R. D. King
View author publications
You can also search for this author in PubMed Google Scholar
M. J. E. Sternberg
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Informatics, Kyushu University, Fukuoka, 812-8581, USA
Setsuo Arikawa
Institute of Scientific and Industrial Research Devision of Intelligent Systems Science, Osaka University, 8-1 Mihogaoka, Ibaraki, Osaka, 567-0047, Japan
Hiroshi Motoda

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Muggleton, S., Srinivasan, A., King, R.D., Sternberg, M.J.E. (1998). Biochemical Knowledge Discovery Using Inductive Logic Programming. In: Arikawa, S., Motoda, H. (eds) Discovey Science. DS 1998. Lecture Notes in Computer Science(), vol 1532. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-49292-5_29

Download citation

DOI: https://doi.org/10.1007/3-540-49292-5_29
Published: 14 January 2003
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-65390-5
Online ISBN: 978-3-540-49292-4
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics