Advertisement

Error estimators for pruning regression trees

  • Luís Torgo
Decision Trees
Part of the Lecture Notes in Computer Science book series (LNCS, volume 1398)

Abstract

This paper presents a comparative study of several methods for estimating the true error of tree-structured regression models. We evaluate these methods in the context of regression tree pruning. The study is focused on problems where large samples of data are available. We present two novel variants of existent estimation methods. We evaluate several methods that follow different approaches to the estimation problem, and perform experimental evaluation in twelve domains. The goal of this evaluation is to characterise the performance of the methods in the task of selecting the best possible tree among the alternative trees considered during pruning. The results of the comparison show that certain estimators lead to very bad decisions in some domains. Our proposed variant of the holdout method obtained the best results in the experimental comparisons.

Keywords

Regression Tree Size Training Sample Correct Ranking Error Estimation Method Training Pool 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

  1. Breiman,L., Friedman,J., Olshen,R. and Stone,C. (1984): Classification and Regression Trees, Wadsworth Int. Group, Belmont, California, USA, 1984.Google Scholar
  2. Cestnik,B. (1990): Estimating probabilities: A crucial task in Machine Learning. In Proc. of the 9th European Conference on Artificial Intelligence (ECAI-90), Pitman Publishers.Google Scholar
  3. Efron,B. (1979): Bootstrap methods: Another look at the jackknife. Annals Statistics,7:1–26.Google Scholar
  4. Karalic,A., Cestnik,B. (1991): The bayesian approach to tree-structured regression. In proceedings of the PT1-91.Google Scholar
  5. Quinlan,J.R. (1993): C4.S: programs for machine learning. Morgan Kaufmann, 1993.Google Scholar
  6. Stone, M. (1974): Cross-validatory choice and assessment of statistical predictions, Journal of the Royal Statistical Society. B 36, 111–147, 1974.Google Scholar
  7. Weiss,S., Indurkhya,N. (1994): Decision Tree Pruning: Biased or Optimal ?. In Proceedings of the AAAI-94.Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 1998

Authors and Affiliations

  • Luís Torgo
    • 1
  1. 1.LIACC-FEPPortoPortugal

Personalised recommendations