Skip to main content

Learning Ontology-Aware Classifiers

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3735))

Abstract

Many practical applications of machine learning in data-driven scientific discovery commonly call for the exploration of data from multiple points of view that correspond to explicitly specified ontologies. This paper formalizes a class of problems of learning from ontology and data, and explores the design space of learning classifiers from attribute value taxonomies (AVTs) and data. We introduce the notion of AVT-extended data sources and partially specified data. We propose a general framework for learning classifiers from such data sources. Two instantiations of this framework, AVT-based Decision Tree classifier and AVT-based Naïve Bayes classifier are presented. Experimental results show that the resulting algorithms are able to learn robust high accuracy classifiers with substantially more compact representations than those obtained by standard learners.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Almuallim H., Akiba, Y., Kaneda, S.: On Handling Tree-Structured Attributes. In: Proceedings of the Twelfth International Conference on Machine Learning (1995)

    Google Scholar 

  2. Akaike, H.: A New Look at Statistical Model Identification. IEEE Trans. on Automatic Control AU-19, 716–722 (1974)

    Google Scholar 

  3. Ashburner, M., et al.: Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nat Genet. 25(1) (2000)

    Google Scholar 

  4. Bergadano, F., Giordana, A.: Guiding Induction with Domain Theories. In: Machine Learning - An Artificial Intelligence Approach, vol. 3, pp. 474–492. Morgan Kaufmann, San Francisco (1990)

    Google Scholar 

  5. Berners-Lee, T., Hendler, J., Lassila, O.: The semantic web. Scientific American (May 2001)

    Google Scholar 

  6. Blum, A., Mitchell, T.: Combining Labeled and Unlabeled Data with Co-Training. In: Annual Conference on Computational Learning Theory (COLT 1998) (1998)

    Google Scholar 

  7. Caragea, D., Silvescu, A., Honavar, V.: A Framework for Learning from Distributed Data Using Sufficient Statistics and its Application to Learning Decision Trees. International Journal of Hybrid Intelligent Systems 1 (2004)

    Google Scholar 

  8. Caragea, D., Pathak, J., Honavar, V.: Learning Classifiers from Semantically Heterogeneous Data. In: 3rd International Conference on Ontologies, Databases, and Applications of Semantics for Large Scale Information Systems (2004)

    Google Scholar 

  9. Clare, A., King, R.: Knowledge discovery in multi-label phenotype data. In: Siebes, A., De Raedt, L. (eds.) PKDD 2001. LNCS (LNAI), vol. 2168, p. 42. Springer, Heidelberg (2001)

    Chapter  Google Scholar 

  10. Cohen, W.: Learning Trees and Rules with Set-valued Features. In: Proceedings of the Thirteenth National Conference on Artificial Intelligence. AAAI Press, Menlo Park (1996)

    Google Scholar 

  11. Dempster, A., Laird, N., Rubin, D.: Maximum Likelihood from Incomplete Data via the EM Algorithm. Journal of the Royal Statistical Society, Series B 39(1), 1–38 (1977)

    MATH  MathSciNet  Google Scholar 

  12. Friedman, N., Geiger, D., Goldszmidt, M.: Bayesian Network Classifiers. Machine Learning 29 (1997)

    Google Scholar 

  13. Han, J., Fu, Y.: Exploration of the Power of Attribute-Oriented Induction in Data Mining. In: Fayyad, U.M., et al. (eds.) Advances in Knowledge Discovery and Data Mining. AAAI/MIT Press (1996)

    Google Scholar 

  14. Haussler, D.: Quantifying Inductive Bias: AI Learning Algorithms and Valiant’s Learning Framework. Artificial Intelligence 36 (1988)

    Google Scholar 

  15. Kang, D., Silvescu, A., Zhang, J., Honavar, V.: Generation of Attribute Value Taxonomies from Data for Data-Driven Construction of Accurate and Compact Classifiers. To appear: Proceedings of The Fourth IEEE International Conference on Data Mining (2004)

    Google Scholar 

  16. Kohavi, R., Provost, P.: Applications of Data Mining to Electronic Commerce. Data Mining and Knowledge Discovery 5 (2001)

    Google Scholar 

  17. Koller, D., Sahami, M.: Hierarchically classifying documents using very few words. In: Proceedings of the 14th Int’l Conference on Machine Learning (1997)

    Google Scholar 

  18. McClean, S., Scotney, B., Shapcott, M.: Aggregation of Imprecise and Uncertain Information in Databases. IEEE Trans. on Knowledge and Data Engineering 13(6), 902–912 (2001)

    Article  Google Scholar 

  19. Pazzani, M., Kibler, D.: The role of prior knowledge in inductive learning. Machine Learning 9, 54–97 (1992)

    Google Scholar 

  20. Quinlan, J.R.: C4.5: Programs for machine learning. Morgan Kaufmann, San Mateo (1992)

    Google Scholar 

  21. Rissanen, J.: Modeling by shortest data description. Automatica 14 (1978)

    Google Scholar 

  22. Sowa, J.: Knowledge Representation: Logical, Philosophical, and Computational Foundations. PWS Publishing, New York (1999)

    Google Scholar 

  23. Taylor, M., Stoffel, K., Hendler, J.: Ontology-based Induction of High Level Classification Rules. In: SIGMOD Data Mining and Knowledge Discovery workshop proceedings. Tuscon, Arizona (1997)

    Google Scholar 

  24. Towell, G., Shavlik, J.: Knowledge-based Artificial Neural Networks. Artificial Intelligence 70 (1994)

    Google Scholar 

  25. Undercoffer, J., et al.: A Target Centric Ontology for Intrusion Detection: Using DAML+OIL to Classify Intrusive Behaviors. To appear, Knowledge Engineering Review - Special Issue on Ontologies for Distributed Systems, Cambridge University Press (2004)

    Google Scholar 

  26. Zhang, J., Silvescu, A., Honavar, V.G.: Ontology-driven induction of decision trees at multiple levels of abstraction. In: Koenig, S., Holte, R.C. (eds.) SARA 2002. LNCS (LNAI), vol. 2371, p. 316. Springer, Heidelberg (2002)

    Chapter  Google Scholar 

  27. Zhang, J., Honavar, V.: Learning Decision Tree Classifiers from Attribute Value Taxonomies and Partially Specified Instances. In: Proceedings of the 20th Int’l Conference on Machine Learning (2003)

    Google Scholar 

  28. Zhang, J., Honavar, V.: AVT-NBL: An Algorithm for Learning Compact and Accurate Naïve Bayes Classifiers from Attribute Value Taxonomies and Data. In: Proceedings of the Fourth IEEE International Conference on Data Mining (2004)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Zhang, J., Caragea, D., Honavar, V. (2005). Learning Ontology-Aware Classifiers. In: Hoffmann, A., Motoda, H., Scheffer, T. (eds) Discovery Science. DS 2005. Lecture Notes in Computer Science(), vol 3735. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11563983_26

Download citation

  • DOI: https://doi.org/10.1007/11563983_26

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-29230-2

  • Online ISBN: 978-3-540-31698-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics