Abstract
When using mixture models, researchers may investigate the associations between cluster membership and covariates by introducing these variables in a (logistic) regression model for the prior class membership probabilities. However, a very popular alternative among applied researchers is a three-step approach in which after estimating the mixture model (step 1) and assigning subjects to clusters (step 2), the cluster assignments are regressed on covariates (step 3). For mixture models for categorical responses, (Bolck et al., Political Anal 12:3–27, 2004) and (Vermunt, Political Anal 18:450–469, 2010) showed this approach may severely downward bias covariate effects, and moreover showed how to adjust for this bias. This paper generalizes their corrections methods to be applicable also with mixture models for continuous responses, where the main complicating factor is that a complex multidimensional integral needs to be solved to obtain the classification errors needed for the corrections. We propose approximating this integral by a summation over the empirical distribution of the response variables. The simulation study showed that the approaches work well, except for the combination of very badly separated components and a small sample size.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Bandeen-Roche, K., Miglioretti, D. L., Zeger, S. L., & Rathouz, P. J. (1997). Latent variable regression for multiple discrete outcomes. Journal of the American Statistical Association, 92, 1375–1386.
Bolck, A., Croon, M. A., & Hagenaars, J. A. (2004). Estimating latent structural models with categorical variables: one – step versus three-step estimators. Political Analysis, 12, 3–27.
Dayton, C. M., & Macready, G. B. (1988). Concomitant-variable latent class models. Journal of the American Statistical Association, 83, 173–178.
McLachlan, G., & Peel, D. (2000). Finite mixture models. New York: John Wiley.
Vermunt, J. K.(2010). Latent class modeling with covariates: two improved three-step approaches. Political Analysis, 18, 450–469.
Vermunt, J. K., & Magidson, J. (2005). Latent GOLD 4.0 user’s guide. Belmont: Statistical Innovations Inc.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer International Publishing Switzerland
About this paper
Cite this paper
Gudicha, D.W., Vermunt, J.K. (2013). Mixture Model Clustering with Covariates Using Adjusted Three-Step Approaches. In: Lausen, B., Van den Poel, D., Ultsch, A. (eds) Algorithms from and for Nature and Life. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Cham. https://doi.org/10.1007/978-3-319-00035-0_8
Download citation
DOI: https://doi.org/10.1007/978-3-319-00035-0_8
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-00034-3
Online ISBN: 978-3-319-00035-0
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)