The problem of missing values in decision tree grafting

Webb, Geoffrey I.

doi:10.1007/BFb0095059

Geoffrey I. Webb¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 1502))

Included in the following conference series:

Australian Joint Conference on Artificial Intelligence

Abstract

Decision tree grafting adds nodes to inferred decision trees. Previous research has demonstrated that appropriate grafting techniques can improve predictive accuracy across a wide cross-selection of domains. However, previous decision tree grafting systems are demonstrated to have a serious deficiency for some data sets containing missing values. This problem arises due to the method for handling missing values employed by C4.5, in which the grafting systems have been embedded. This paper provides an explanation of and solution to the problem. Experimental evidence is presented of the efficacy of this solution.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Ali, K., Brunk, C., & Pazzani, M. (1994). On learning multiple descriptions of a concept. In Proceedings of Tools with Artificial Intelligence, pp. 476–483 New Orleans, LA.
Google Scholar
Breiman, L. (1996). Bagging predictors. Machine Learning, 24, 123–140.
MATH MathSciNet Google Scholar
Breiman, L., Friedman, J. H., Olshen, R. A., & Stone, C. J. (1984). Classification and Regression Trees. Wadsworth International, Belmont, Ca.
MATH Google Scholar
Dietterich, T. G., & Bakiri, G. (1994). Solving multiclass learning problems via error-correcting output codes. Journal of Artificial Intelligence Research, 2, 263–286.
Google Scholar
Freund, Y., & Schapire, R. E. (1995). A decision-theoretic generalization of online learning and an application to boosting. In Proceedings of the Second European Conference on Machine Learning, pp. 23–37. Springer-Verlag.
Google Scholar
Kwok, S. W., & Carter, C. (1990). Multiple decision tress. In Shachter, R. D., Levitt, T. S., Kanal, L. N., & Lemmer, J. F. (Eds.), Uncertainty in Artificial Intelligence 4, pp. 327–335. North Holland, Amsterdam.
Google Scholar
Merz, C. J., & Murphy, P. M. (1998). UCI repository of machine learning databases. [Machine-readable data repository]. University of California, Department of Information and Computer Science, Irvine, CA.
Google Scholar
Niblett, T., & Bratko, I. (1986). Learning decision rules in noisy domains. In Bramer, M. A. (Ed.), Research and Development in Expert Systems III, pp. 25–34. Cambridge University Press, Cambridge.
Google Scholar
Nock, R., & Gascuel, O. (1995). On learning decision committees. In Proceedings of the Twelfth International Conference on Machine Learning, pp. 413–420 Taho City, Ca. Morgan Kaufmann.
Google Scholar
Oliver, J. J., & Hand, D. J. (1995). On pruning and averaging decision trees. In Proceedings of the Twelfth International Conference on Machine Learning, pp. 430–437. Taho City, Ca. Morgan Kaufmann.
Google Scholar
Quinlan, J. R. (1993). C4.5: Programs for Machine Learning. Morgan Kaufmann, San Mateo, CA.
Google Scholar
Schapire, R. E. (1990). The strength of weak learnability. Machine Learning, 5, 197–227.
Google Scholar
Webb, G. I. (1996). Further experimental evidence against the utility of Occam’s razor. Journal of Artificial Intelligence Research, 4, 397–417.
MATH MathSciNet Google Scholar
Webb, G. I. (1997). Decision tree grafting. In IJCAI-97: Fifteenth International Joint Conference on Artificial Intelligence, pp. 846–851 Nagoya, Japan. Morgan Kaufmann.
Google Scholar
Wolpert, D. H. (1992). Stacked generalization. Neural Networks, 5, 241–259.
Article Google Scholar

Download references

Author information

Authors and Affiliations

School of Computing and Mathematics, Deakin University, 3217, Geelong, Vic, Austrlia
Geoffrey I. Webb

Authors

Geoffrey I. Webb
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Grigoris Antoniou John Slaney

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Webb, G.I. (1998). The problem of missing values in decision tree grafting. In: Antoniou, G., Slaney, J. (eds) Advanced Topics in Artificial Intelligence. AI 1998. Lecture Notes in Computer Science, vol 1502. Springer, Berlin, Heidelberg . https://doi.org/10.1007/BFb0095059

Download citation

DOI: https://doi.org/10.1007/BFb0095059
Published: 19 October 2006
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-65138-3
Online ISBN: 978-3-540-49561-1
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics