Abstract
This paper provides a statistical testing approach to the validation of the pruning process in regression trees construction. In particular, the testing procedure, based on the F distribution, is applied to the CART sequence of pruned subtrees providing a single tree prediction rule which is statistically reliable and might not coincide with any tree in the sequence.
The present paper is financially supported by MURST funds
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Breiman L., Friedman J.H., Olshen R.A., Stone C.J. (1984) Classification and Regression Trees, Wadsworth, Belmont CA.
Breiman L. (1996) Bagging Predictors, Machine Learning, 24, 123–140.
Cappelli, C., Siciliano, R., (1998), Strategies for Choosing the best Decision Tree, in: Analyse Multidimensionnelle des Donnèes, IV Congrès International N’GUS97 CERESTA-CISIA ed.
Cappelli C., Mola F., Siciliano R. (1998) An alternative pruning method based on the impurity-complexity measure, in: Proceedings in Computational Statistics, Cappelli C., Mola F., Siciliano R eds, 221–226, Physica-Verlag.
Harrison D., Rubinfeld D.L. (1978) Hedonic prices and the demand for clean air, Journal of Environmental Economics and Management, 5, 81–102.
Lanubile A., Malerba D. (1998) Induction of Regression Trees with Regtree, Classification and Data Analysis: Book of Short Papers, 253–256, Meeting of the Italian Group of Classification, Pescara.
Mingers J. (1987) Expert System- Rule Induction with Statistical Data, Journal of the Operational Research Society, 38, 39–47.
Mingers J. (1989) An Empirical Comparison of Pruning Methods for Decision Tree Induction, Machine Learning, 4, 227–243.
Morgan, J.N., Sonquist, J.A. (1963). Problems in the analysis of survey data and a proposal, Journal of American Statistical Association, 58, 415–434.
Siciliano, R., Mola F. (1996) A Fast Regression Tree Procedure, in: Statistical Modelling: Proceedings of the 11th International Workshop on Statistical Modelling, A. Forcina et al. eds, 332–340, Perugia: Graphos.
Siciliano, R. (1998) Exploratory versus Decison Trees, in Proceedings in Computational Statistics, invited paper, R. Payne and P. Green eds, 113–124, Physica-Verlag.
Zhang H., Singer B. (1999). Recursive Partitioning in the Health Sciences, New York: Springer Verlag.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2001 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Cappelli, C., Mola, F., Siciliano, R. (2001). Selecting Regression Tree Models: a Statistical Testing Procedure1 . In: Borra, S., Rocci, R., Vichi, M., Schader, M. (eds) Advances in Classification and Data Analysis. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-59471-7_31
Download citation
DOI: https://doi.org/10.1007/978-3-642-59471-7_31
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-41488-9
Online ISBN: 978-3-642-59471-7
eBook Packages: Springer Book Archive