Learning Ontology-Aware Classifiers

Zhang, Jun; Caragea, Doina; Honavar, Vasant

doi:10.1007/11563983_26

Learning Ontology-Aware Classifiers

Jun Zhang²¹,
Doina Caragea²¹ &
Vasant Honavar²¹

Conference paper

739 Accesses
4 Citations

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3735))

Abstract

Many practical applications of machine learning in data-driven scientific discovery commonly call for the exploration of data from multiple points of view that correspond to explicitly specified ontologies. This paper formalizes a class of problems of learning from ontology and data, and explores the design space of learning classifiers from attribute value taxonomies (AVTs) and data. We introduce the notion of AVT-extended data sources and partially specified data. We propose a general framework for learning classifiers from such data sources. Two instantiations of this framework, AVT-based Decision Tree classifier and AVT-based Naïve Bayes classifier are presented. Experimental results show that the resulting algorithms are able to learn robust high accuracy classifiers with substantially more compact representations than those obtained by standard learners.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Almuallim H., Akiba, Y., Kaneda, S.: On Handling Tree-Structured Attributes. In: Proceedings of the Twelfth International Conference on Machine Learning (1995)
Google Scholar
Akaike, H.: A New Look at Statistical Model Identification. IEEE Trans. on Automatic Control AU-19, 716–722 (1974)
Google Scholar
Ashburner, M., et al.: Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat Genet. 25(1) (2000)
Google Scholar
Bergadano, F., Giordana, A.: Guiding Induction with Domain Theories. In: Machine Learning - An Artificial Intelligence Approach, vol. 3, pp. 474–492. Morgan Kaufmann, San Francisco (1990)
Google Scholar
Berners-Lee, T., Hendler, J., Lassila, O.: The semantic web. Scientific American (May 2001)
Google Scholar
Blum, A., Mitchell, T.: Combining Labeled and Unlabeled Data with Co-Training. In: Annual Conference on Computational Learning Theory (COLT 1998) (1998)
Google Scholar
Caragea, D., Silvescu, A., Honavar, V.: A Framework for Learning from Distributed Data Using Sufficient Statistics and its Application to Learning Decision Trees. International Journal of Hybrid Intelligent Systems 1 (2004)
Google Scholar
Caragea, D., Pathak, J., Honavar, V.: Learning Classifiers from Semantically Heterogeneous Data. In: 3rd International Conference on Ontologies, Databases, and Applications of Semantics for Large Scale Information Systems (2004)
Google Scholar
Clare, A., King, R.: Knowledge discovery in multi-label phenotype data. In: Siebes, A., De Raedt, L. (eds.) PKDD 2001. LNCS (LNAI), vol. 2168, p. 42. Springer, Heidelberg (2001)
Chapter Google Scholar
Cohen, W.: Learning Trees and Rules with Set-valued Features. In: Proceedings of the Thirteenth National Conference on Artificial Intelligence. AAAI Press, Menlo Park (1996)
Google Scholar
Dempster, A., Laird, N., Rubin, D.: Maximum Likelihood from Incomplete Data via the EM Algorithm. Journal of the Royal Statistical Society, Series B 39(1), 1–38 (1977)
MATH MathSciNet Google Scholar
Friedman, N., Geiger, D., Goldszmidt, M.: Bayesian Network Classifiers. Machine Learning 29 (1997)
Google Scholar
Han, J., Fu, Y.: Exploration of the Power of Attribute-Oriented Induction in Data Mining. In: Fayyad, U.M., et al. (eds.) Advances in Knowledge Discovery and Data Mining. AAAI/MIT Press (1996)
Google Scholar
Haussler, D.: Quantifying Inductive Bias: AI Learning Algorithms and Valiant’s Learning Framework. Artificial Intelligence 36 (1988)
Google Scholar
Kang, D., Silvescu, A., Zhang, J., Honavar, V.: Generation of Attribute Value Taxonomies from Data for Data-Driven Construction of Accurate and Compact Classifiers. To appear: Proceedings of The Fourth IEEE International Conference on Data Mining (2004)
Google Scholar
Kohavi, R., Provost, P.: Applications of Data Mining to Electronic Commerce. Data Mining and Knowledge Discovery 5 (2001)
Google Scholar
Koller, D., Sahami, M.: Hierarchically classifying documents using very few words. In: Proceedings of the 14th Int’l Conference on Machine Learning (1997)
Google Scholar
McClean, S., Scotney, B., Shapcott, M.: Aggregation of Imprecise and Uncertain Information in Databases. IEEE Trans. on Knowledge and Data Engineering 13(6), 902–912 (2001)
Article Google Scholar
Pazzani, M., Kibler, D.: The role of prior knowledge in inductive learning. Machine Learning 9, 54–97 (1992)
Google Scholar
Quinlan, J.R.: C4.5: Programs for machine learning. Morgan Kaufmann, San Mateo (1992)
Google Scholar
Rissanen, J.: Modeling by shortest data description. Automatica 14 (1978)
Google Scholar
Sowa, J.: Knowledge Representation: Logical, Philosophical, and Computational Foundations. PWS Publishing, New York (1999)
Google Scholar
Taylor, M., Stoffel, K., Hendler, J.: Ontology-based Induction of High Level Classification Rules. In: SIGMOD Data Mining and Knowledge Discovery workshop proceedings. Tuscon, Arizona (1997)
Google Scholar
Towell, G., Shavlik, J.: Knowledge-based Artificial Neural Networks. Artificial Intelligence 70 (1994)
Google Scholar
Undercoffer, J., et al.: A Target Centric Ontology for Intrusion Detection: Using DAML+OIL to Classify Intrusive Behaviors. To appear, Knowledge Engineering Review - Special Issue on Ontologies for Distributed Systems, Cambridge University Press (2004)
Google Scholar
Zhang, J., Silvescu, A., Honavar, V.G.: Ontology-driven induction of decision trees at multiple levels of abstraction. In: Koenig, S., Holte, R.C. (eds.) SARA 2002. LNCS (LNAI), vol. 2371, p. 316. Springer, Heidelberg (2002)
Chapter Google Scholar
Zhang, J., Honavar, V.: Learning Decision Tree Classifiers from Attribute Value Taxonomies and Partially Specified Instances. In: Proceedings of the 20th Int’l Conference on Machine Learning (2003)
Google Scholar
Zhang, J., Honavar, V.: AVT-NBL: An Algorithm for Learning Compact and Accurate Naïve Bayes Classifiers from Attribute Value Taxonomies and Data. In: Proceedings of the Fourth IEEE International Conference on Data Mining (2004)
Google Scholar

Download references

Author information

Authors and Affiliations

Artificial Intelligence Research Laboratory, Department of Computer Science, Iowa State University, Ames, Iowa, 50011-1040, USA
Jun Zhang, Doina Caragea & Vasant Honavar

Authors

Jun Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Doina Caragea
View author publications
You can also search for this author in PubMed Google Scholar
Vasant Honavar
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Computer Science & Engineering, The University of New South Wales, Sydney, Australia
Achim Hoffmann
Institute of Scientific and Industrial Research, Osaka University, 8-1 Mihogaoka, 567-0047, Ibaraki, Osaka, Japan
Hiroshi Motoda
Max Planck Institute for Computer Science, Saarbrücken, Germany
Tobias Scheffer

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zhang, J., Caragea, D., Honavar, V. (2005). Learning Ontology-Aware Classifiers. In: Hoffmann, A., Motoda, H., Scheffer, T. (eds) Discovery Science. DS 2005. Lecture Notes in Computer Science(), vol 3735. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11563983_26

Download citation

DOI: https://doi.org/10.1007/11563983_26
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-29230-2
Online ISBN: 978-3-540-31698-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics