Abstract
At present regression trees tend to be accurate, however they can be incomprehensible to experts. The proposed algorithm Economic Generalized Regression (EGR) induces regression trees that are more logical and convenient. EGR uses domain knowledge. The domain knowledge contains “is-a” hierarchies and cost associated to each variable. After generating several subtrees from training examples, EGR selects the best one according to a user-defined balance between accuracy and average classification cost. The user can define the degree of economy and generalization. This information will influence directly on the quality of search that the algorithm must undertake.
This work has been partially supported by project FACA number PB98–0937–C04–01 of the CICYT, Spain. FACA is apart of the FRESCO project.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Breiman, L., Friedman, J.H., Olsen, R.A., and Stone, C.J., (1984). Classification and Regression Trees, Wadsworth Int. Group, Belmont, California.
Karalic, A. (1992). Employing linear regression in regression tree leaves.Proceedings of European Congress on Artificial Intelligence, Vienna, Austria.
Karalic A, (1997). First Order Regression. Machine Learning, Vol. 6, 26 (2/3):147–176, Kluwer Academic Publishers.
Le Blanc, M. and Tirshirani, R. (1998).Monotone Shrinkage of Trees. Journal of Computational and Graphics Statistics, Volume 7, Number 4.
Morimoto, Y., Ishii, H. and Morishita, S. (1997). Efficient Construction of Regression Trees with Range and Region Splitting. VLDB 1997.
Núñez, M. (2000). Learing Patterns of Behavior by Observing System Events. To appear in Proceedings of European Conference of Machine Learning.
Núñez, M. (1991). The Use of Background Knowledge in Decision Tree Induction.Machine Learning, Vol. 6, Number 3, Kluwer Academic Publishers.
Núñez, M. (1988). Economic Induction: A Case Study,Proceeding of Third European Working Session on Learning, Pitman Publishing, London.
Tan, M. (1991). Learning a cost-sensitive internal representation for reinforcement learning. In Proceedings of IWML-1991. Morgan Kaufmann.
Tanner, W.F. (1962). Component of the Hypsometric Curve of the Earth. J. Geophis. Res.
Titterington, D.M., Smith, A.F.M. and Makov, U.E. (1987).Statistical Analysis of Finite Mixture Distributions, John Wiley & Sons.
Tumey, P.D. (1995). Cost-sensitive classification: Empirical evaluation of a hybrid genetic decision tree induction algorithm, Journal AI Research, 2 369.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2000 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Núñez, M. (2000). Generalized regression trees1 . In: Bethlehem, J.G., van der Heijden, P.G.M. (eds) COMPSTAT. Physica, Heidelberg. https://doi.org/10.1007/978-3-642-57678-2_48
Download citation
DOI: https://doi.org/10.1007/978-3-642-57678-2_48
Publisher Name: Physica, Heidelberg
Print ISBN: 978-3-7908-1326-5
Online ISBN: 978-3-642-57678-2
eBook Packages: Springer Book Archive