Abstract
Before applying an automated model selection procedure, one must first choose the class (or family) of models from which the model will be selected. If there is no prior knowledge about the data that indicates the best class of models, then the choice is difficult at best. In this chapter, we present an approach to automating this step in classifier construction. In addition to searching for the best model, our approach searches for the best model class using a heuristic search strategy that finds the best model class for each recursive call of a divide-and-conquer tree induction algorithm. The end result is a hybrid tree-structured classifier, which allows different subspaces of a data set to be fit by models from different model classes. During search for the best model, the method considers whether and why a model class is a poor choice, and selects a better class on that basis. We describe an implementation of the approach, the MCS system, and present experimental results illustrating the system’s ability to identify the best model (and model class) efficiently.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Box, D. R. (1990). “Role of models in statistical analysis.” Statistical Science, 5, 169–174.
Breiman, L., Friedman, J. H., Olshen, R. A., & Stone, C. J. (1984). Classification and regression trees. Belmont, CA: Wadsworth International Group.
Breiman, L., Friedman, J. H., Olshen, R. A., & Stone, C. J. (1984). Classification and regression trees. Belmont, CA: Wadsworth International Group.
Cook, R. D., & Weisberg, S. (1982). Residuals and influence in regression. Chapman and Hall.
Detrano, R., Janosi, A., Steinbrunn, W., Pfisterer, M., Schmid, J., Sandhu, S., Guppy, K., Lee, S., & Froelicher, V. (1989). “International application of a new probability algorithm for the diagnosis of coronary artery disese.” American Journal of Cardiology, 64, 304–310.
Frean, M. (1990). Small nets and short paths: Optimising neural computation. Doctoral dissertation, Center for Cognitive Science, University of Edinburgh.
Kittler, J. (1986). “Feature selection and extraction.” Handbook of pattern recognition and image processing.
Lehmann, E. L. (1990). “Model specification: The views of Fisher and Neyman, and later developments.” Statistical Science, 5, 160–168.
Linhart, H., & Zucchini, W. (1986). Model selection. NY: Wiley.
Matheus, C. J. (1990). Feature construction: An analytic framework and an application to decision trees. Doctoral dissertation, Department of Computer Science, University of Illinois, Urbana-Champaign, IL.
Nilsson, N. J. (1965). Learning machines. McGraw-Hill
Pagallo, G. M. (1990). Adaptive decision tree algorithms for learning from examples. Doctoral dissertation, University of California at Santa Cruz.
Quinlan, J. R. (1986). “Induction of decision trees.” Machine Learning, 1, 81–106.
Quinlan, J. R. (1987). “Simplifying decision trees” Internation Journal of Man-Machine Studies, 27, 221–234.
Rissanen, J. (1989). Stochastic complexity in statistical inquiry. New Jersey: World Scientific.
Safavian, S. R., & Langrebe, D. (1991). “A survey of decision tree classifier methodology.” IEEE Transactions on Systems, Man and Cybernetics, 21, 660–674.
Young, R (1984). Recursive estimation and time-series analysis. New York: Springer- Verlag.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1994 Springer-Verlag New York, Inc.
About this paper
Cite this paper
Brodley, C.E., Utgoff, P.E. (1994). Dynamic Recursive Model Class Selection for Classifier Construction. In: Cheeseman, P., Oldford, R.W. (eds) Selecting Models from Data. Lecture Notes in Statistics, vol 89. Springer, New York, NY. https://doi.org/10.1007/978-1-4612-2660-4_34
Download citation
DOI: https://doi.org/10.1007/978-1-4612-2660-4_34
Publisher Name: Springer, New York, NY
Print ISBN: 978-0-387-94281-0
Online ISBN: 978-1-4612-2660-4
eBook Packages: Springer Book Archive