Scale-constrained approaches for maximum likelihood estimation and model selection of clusterwise linear regression models
- 26 Downloads
We consider an equivariant approach imposing data-driven bounds for the variances to avoid singular and spurious solutions in maximum likelihood estimation of clusterwise linear regression models. We investigate its use in the choice of the number of components and we propose a computational shortcut, which significantly reduces the computational time needed to tune the bounds on the data. In the simulation study and the two real-data applications, we show that the proposed methods guarantee a reliable assessment of the number of components compared to standard unconstrained methods, together with accurate model parameters estimation and cluster recovery.
KeywordsClusterwise linear regression Mixtures of linear regression models Data-driven constraints Equivariant estimators Computationally efficient approach Model selection
- Alfó M, Viviani S (2016) Finite mixtures of structured models. In: Hennig C, Meila M, Murtagh F, Rocci R (eds) Handbook of cluster analysis. Chapman & Hall, Boca Raton, pp 217–240Google Scholar
- Ritter G (2014) Robust cluster analysis and variable selection. Monographs on statistics and applied probability, vol 137. CRC PressGoogle Scholar
- Smyth P (1996) Clustering using Monte-Carlo cross validation. In: Proceedings of the second international conference on knowledge discovery and data mining, Menlo Park, CA, AAAI Press, pp 126–133Google Scholar