Abstract
In many application domains, there is a need for learning algorithms that generate accurate as well as comprehensible classifiers. In this paper, we present TRIPPER – a rule induction algorithm that extends RIPPER, a widely used rule-learning algorithm. TRIPPER exploits knowledge in the form of taxonomies over the values of features used to describe data. We compare the performance of TRIPPER with that of RIPPER on benchmark datasets from the Reuters 21578 corpus using WordNet (a human-generated taxonomy) to guide rule induction by TRIPPER. Our experiments show that the rules generated by TRIPPER are generally more comprehensible and compact and in the large majority of cases at least as accurate as those generated by RIPPER.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Cohen, W.W.: Fast effective rule induction. In: Proceedings of International Conference on Machine Learning, Lake Tahoe, CA (1995)
Kang, D.-K., Silvescu, A., Zhang, J., Honavar, V.: Generation of Attribute Value Taxonomies from Data for Data-Driven Construction of Accurate and Compact Classifiers. In: Proceedings of the 4th IEEE International Conference on Data Mining, Brighton, UK (2004)
Fellbaum, C.: WordNet, An Electronic Lexical Database. The MIT Press, Cambridge (1998)
Fürnkranz, J., Widmer, G.: Incremental reduced error pruning. In: Proceedings of International Conference on Machine Learning, New Brunswick, NJ (1994)
Quinlan, J.R.: MDL and categorical theories. In: Proceedings of International Conference on Machine Learning, Lake Tahoe, CA (1995)
McCallum, A., Nigam, K.: A comparison of event models for naive bayes text classification. In: AAAI-1998 Workshop on Learning for Text Categorization, pp. 3–5 (1998)
Michalski, R.S.: Attributional Calculus: A Logic and Representation Language for Natural Induction. In: Reports of the Machine Learning and Inference Laboratory, MLI 04-2, George Mason University, Fairfax, VA (2004)
Zhang, J., Honavar, V.: Learning decision tree classifiers from attribute value taxonomies and partially specified data. In: Proceedings of International Conference on Machine Learning, Washington, DC (2003)
Zhang, J., Honavar, V.: AVT-NBL 2004: An algorithm for learning compact and accurate naive bayes classifiers from feature value taxonomies and data. In: Proceedings of the Fourth IEEE International Conference on Data Mining, Brighton, UK (2004)
Apte, C., Damerau, F., Weiss Sholom, M.: Towards language independent automated learning of text categorization models. In: SIGIR 1994, pp. 23–30. Springer, New York (1994)
Vasile, F., Silvescu, A., Kang, D.-K., Honavar, V.: TRIPPER: Rule learning using taxonomies, Tehnical Report ISU-CS-TR, Department of Computer Science, Iowa State University (January 2006), Publicly available at, http://www.cs.iastate.edu/~flavian/tripper_long.pdf
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Vasile, F., Silvescu, A., Kang, DK., Honavar, V. (2006). TRIPPER: Rule Learning Using Taxonomies. In: Ng, WK., Kitsuregawa, M., Li, J., Chang, K. (eds) Advances in Knowledge Discovery and Data Mining. PAKDD 2006. Lecture Notes in Computer Science(), vol 3918. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11731139_9
Download citation
DOI: https://doi.org/10.1007/11731139_9
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-33206-0
Online ISBN: 978-3-540-33207-7
eBook Packages: Computer ScienceComputer Science (R0)