Abstract
Three models for linear regression clustering are given, and corresponding methods for classification and parameter estimation are developed and discussed: The mixture model with fixed regressors (ML-estimation), the fixed partition model with fixed regressors (ML-estimation), and the mixture model with random regressors (Fixed Point Clustering). The number of clusters is treated as unknown. The approaches are compared via an application to Fisher’s Iris data. By the way, a broadly ignored feature of these data is discovered.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
BOCK, H.H. (1969): The equivalence of two extremal problems and its application to the iterative classification of multivariate data. Lecture note, Mathematisches Forschungsinstitut Oberwolfach.
BRYANT, P. G. and WILLIAMSON, J. A. (1986): Maximum likelihood and classification: A comparison of three approaches. In: Gaul, W., and Schader, W. (Eds.): Classification as a Tool of Research, Elsevier, Amsterdam, 35–45.
DAVIES, P. L. and GATHER, U. (1993): The identification of multiple outliers. Journal of the American Statistical Association 88, 782–801.
DESARBO, W. S. and CRON, W. L. (1988): A Maximum Likelihood Methodology for Clusterwise Linear Regression. Journal of Classification 5, 249–282.
FISHER, R. A. (1936): The use of multiple measurements in taxonomic problems. Annals of Eugenics 7, 179–184.
HENNIG, C. (1996): Identifiability of Finite Linear Regression Mixtures. Preprint No. 96–6, Institut für Mathematsche Stochastik, Universität Hamburg.
HENNIG, C. (1997a): Fixed Point Clusters and their Relation to Stochastic Models. In: Klar, R., and Opitz, O. (Eds.): Classification and Knowledge Organization, Springer, Berlin, 20–28.
HENNIG, C. (1997b): Datenanalyse mit Modellen für Cluster linearer Regression. Dissertation, Institut für Mathematsche Stochastik, Universität Hamburg.
HENNIG, C. (1998): Clustering and Outlier Identification: Fixed Point Cluster Analysis. In: Rizzi, A., Vichi, M., and Bock, H.-H. (Eds.): Advances in Data Science and Classification, Springer, Berlin, 37–42.
KIEFER, N. M. (1978): Discrete parameter variation: Efficient estimation of a switching regression model. In: Econometrica 46, 427–434.
KRISHNAIAH, P. R. and MIAO, B. Q. (1988): Review about estimation of change points. In: Krishnaiah, P. R., and Rao, P. C. (Eds.): Handbook of Statistics, Vol. 7, Elsevier, Amsterdam, 375–402.
QUANDT, R. E. and RAMSEY, J. B. (1978): Estimating mixtures of Normal distributions and switching regressions. Journal of the American Statistical Association 73, 730–752.
SPAETH, H. (1979): Clusterwise linear regression. Computing 22, 367–373.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1999 Springer-Verlag Berlin · Heidelberg
About this paper
Cite this paper
Hennig, C. (1999). Models and Methods for Clusterwise Linear Regression. In: Gaul, W., Locarek-Junge, H. (eds) Classification in the Information Age. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-60187-3_17
Download citation
DOI: https://doi.org/10.1007/978-3-642-60187-3_17
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-65855-9
Online ISBN: 978-3-642-60187-3
eBook Packages: Springer Book Archive