Scale-constrained approaches for maximum likelihood estimation and model selection of clusterwise linear regression models

Di Mari, Roberto; Rocci, Roberto; Gattone, Stefano Antonio

doi:10.1007/s10260-019-00480-y

Scale-constrained approaches for maximum likelihood estimation and model selection of clusterwise linear regression models

Original Paper
Published: 25 June 2019

Volume 29, pages 49–78, (2020)
Cite this article

Statistical Methods & Applications Aims and scope Submit manuscript

Roberto Di Mari ORCID: orcid.org/0000-0001-5498-009X¹,
Roberto Rocci² &
Stefano Antonio Gattone³

157 Accesses
1 Citation
Explore all metrics

Abstract

We consider an equivariant approach imposing data-driven bounds for the variances to avoid singular and spurious solutions in maximum likelihood estimation of clusterwise linear regression models. We investigate its use in the choice of the number of components and we propose a computational shortcut, which significantly reduces the computational time needed to tune the bounds on the data. In the simulation study and the two real-data applications, we show that the proposed methods guarantee a reliable assessment of the number of components compared to standard unconstrained methods, together with accurate model parameters estimation and cluster recovery.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Cluster Validation for Mixtures of Regressions via the Total Sum of Squares Decomposition

Article 16 July 2019

Robust estimation of the number of components for mixtures of linear regression models

Article 04 August 2015

Assessing trimming methodologies for clustering linear regression data

Article Open access 30 July 2018

Notes

Computer programs are available from the corresponding author upon request.
We checked that this is also the case for \(n>200\). Related figures are available from the corresponding author upon request.

References

Alfó M, Viviani S (2016) Finite mixtures of structured models. In: Hennig C, Meila M, Murtagh F, Rocci R (eds) Handbook of cluster analysis. Chapman & Hall, Boca Raton, pp 217–240
MATH Google Scholar
Arlot S, Celisse A (2010) Cross-validation procedures for model selection. Stat Surv 4:40–79
Article MathSciNet Google Scholar
Bagirov AM, Ugon J, Mirzayeva H (2013) Nonsmooth nonconvex optimization approach to clusterwise linear regression problems. Eur J Oper Res 229(1):132–142
Article MathSciNet Google Scholar
Carbonneau RA, Caporossi G, Hansen P (2011) Globally optimal clusterwise regression by mixed logical-quadratic programming. Eur J Oper Res 212(1):213–222
Article MathSciNet Google Scholar
Cerioli A, García-Escudero LA, Mayo-Iscar A, Riani M (2017) Finding the number of groups in model-based clustering via constrained likelihoods. J Comput Graph Stat. https://doi.org/10.1080/10618600.2017.1390469
Article Google Scholar
Day NE (1969) Estimating the components of a mixture of two normal distributions. Biometrika 56:463–474
Article MathSciNet Google Scholar
Dempster AP, Laird NM, Rubin DB (1977) Maximum likelihood from incomplete data via the EM algorithm. J R Stat Soc Ser B (Stat Methodol) 39:1–38
MathSciNet MATH Google Scholar
Di Mari R, Rocci R, Gattone SA (2017) Clusterwise linear regression modeling with soft scale constraints. Int J Approx Reason 91:160–178
Article MathSciNet Google Scholar
Fraley C, Raftery AE (2007) Bayesian regularization for normal mixture estimation and model-based clustering. J Classif 24(2):155–181
Article MathSciNet Google Scholar
García-Escudero LA, Gordaliza A, Greselin F, Ingrassia S, Mayo-Iscar A (2017) Eigenvalues and constraints in mixture modeling: geometric and computational issues. Adv Data Anal Classif. https://doi.org/10.1007/s11634-017-0293-y
Article MATH Google Scholar
Hathaway RJ (1985) A constrained formulation of maximum-likelihood estimation for normal mixture distributions. Ann Stat 13:795–800
Article MathSciNet Google Scholar
Hennig C, Liao TF (2013) How to find an appropriate clustering for mixed-type variables with application to socio-economic stratification. J R Stat Soc Ser C 62(3):309–369
Article MathSciNet Google Scholar
Hubert L, Arabie P (1985) Comparing partitions. J Classif 2:193–218
Article Google Scholar
Ingrassia S (2004) A likelihood-based constrained algorithm for multivariate normal mixture models. Stat Methods Appl 13:151–166
Article MathSciNet Google Scholar
Ingrassia S, Rocci R (2007) A constrained monotone EM algorithm for finite mixture of multivariate Gaussians. Comput Stat Data Anal 51:5339–5351
Article MathSciNet Google Scholar
Keribin C (2000) Consistent estimation of the order of mixture models. Sankhyā 62:49–66
MathSciNet MATH Google Scholar
Kiefer NM (1978) Discrete parameter variation: efficient estimation of a switching regression model. Econometrica 46:427–434
Article MathSciNet Google Scholar
Kiefer J, Wolfowitz J (1956) Consistency of the maximum likelihood estimator in the presence of infinitely many incidental parameters. Ann Math Stat 27:886–906
MathSciNet MATH Google Scholar
Kim D, Seo B (2014) Assessment of the number of components in Gaussian mixture models in the presence of multiple local maximizers. J Multivar Anal 125:100–120
Article MathSciNet Google Scholar
Koehler AB, Murphree ES (1988) A comparison of the Akaike and Schwarz criteria for selecting model order. Appl Stat 37:187–195
Article MathSciNet Google Scholar
Leroux BG (1992) Consistent estimation of a mixing distribution. Ann Stat 20:1350–1360
Article MathSciNet Google Scholar
McLachlan GJ, Peel D (2000) Finite mixture models. Wiley, New York
Book Google Scholar
Quandt RE (1972) A new approach to estimating switching regressions. J Am Stat Assoc 67(338):306–310
Article Google Scholar
Quandt RE, Ramsey JB (1978) Estimating mixtures of normal distributions and switching regressions. J Am Stat Assoc 73(364):730–738
Article MathSciNet Google Scholar
Ritter G (2014) Robust cluster analysis and variable selection. Monographs on statistics and applied probability, vol 137. CRC Press
Rocci R, Gattone SA, Di Mari R (2017) A data driven equivariant approach to constrained Gaussian mixture modeling. Adv Data Anal Classif. https://doi.org/10.1007/s11634-016-0279-1
Article MATH Google Scholar
Seo B, Kim D (2012) Root selection in normal mixture models. Comput Stat Data Anal 56:2454–2470
Article MathSciNet Google Scholar
Seo B, Lindsay BG (2010) A computational strategy for doubly smoothed MLE exemplified in the normal mixture model. Comput Stat Data Anal 54(8):1930–1941
Article MathSciNet Google Scholar
Smyth P (1996) Clustering using Monte-Carlo cross validation. In: Proceedings of the second international conference on knowledge discovery and data mining, Menlo Park, CA, AAAI Press, pp 126–133
Smyth P (2000) Model selection for probabilistic clustering using cross-validated likelihood. Stat Comput 10(1):63–72
Article Google Scholar
Zou H, Hastie T, Tibshirani R (2007) On the “degrees of freedom” of the lasso. Ann Stat 35(5):2173–2192
Article MathSciNet Google Scholar

Download references

Author information

Authors and Affiliations

Department of Economics and Business, University of Catania, Catania, Italy
Roberto Di Mari
Department of Economics and Finance, University of Rome Tor Vergata, Rome, Italy
Roberto Rocci
Department of Philosophical and Social Sciences, Economics and Quantitative Methods, University G. d’Annunzio, Chieti-Pescara, Italy
Stefano Antonio Gattone

Authors

Roberto Di Mari
View author publications
You can also search for this author in PubMed Google Scholar
Roberto Rocci
View author publications
You can also search for this author in PubMed Google Scholar
Stefano Antonio Gattone
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Roberto Di Mari.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Di Mari, R., Rocci, R. & Gattone, S.A. Scale-constrained approaches for maximum likelihood estimation and model selection of clusterwise linear regression models. Stat Methods Appl 29, 49–78 (2020). https://doi.org/10.1007/s10260-019-00480-y

Download citation

Accepted: 15 June 2019
Published: 25 June 2019
Issue Date: March 2020
DOI: https://doi.org/10.1007/s10260-019-00480-y

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Scale-constrained approaches for maximum likelihood estimation and model selection of clusterwise linear regression models

Abstract

Access this article

Similar content being viewed by others

Cluster Validation for Mixtures of Regressions via the Total Sum of Squares Decomposition

Robust estimation of the number of components for mixtures of linear regression models

Assessing trimming methodologies for clustering linear regression data

Notes

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Scale-constrained approaches for maximum likelihood estimation and model selection of clusterwise linear regression models

Abstract

Access this article

Similar content being viewed by others

Cluster Validation for Mixtures of Regressions via the Total Sum of Squares Decomposition

Robust estimation of the number of components for mixtures of linear regression models

Assessing trimming methodologies for clustering linear regression data

Notes

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation