Skip to main content

Mixture Model Clustering with Covariates Using Adjusted Three-Step Approaches

  • Conference paper
  • First Online:
Algorithms from and for Nature and Life

Abstract

When using mixture models, researchers may investigate the associations between cluster membership and covariates by introducing these variables in a (logistic) regression model for the prior class membership probabilities. However, a very popular alternative among applied researchers is a three-step approach in which after estimating the mixture model (step 1) and assigning subjects to clusters (step 2), the cluster assignments are regressed on covariates (step 3). For mixture models for categorical responses, (Bolck et al., Political Anal 12:3–27, 2004) and (Vermunt, Political Anal 18:450–469, 2010) showed this approach may severely downward bias covariate effects, and moreover showed how to adjust for this bias. This paper generalizes their corrections methods to be applicable also with mixture models for continuous responses, where the main complicating factor is that a complex multidimensional integral needs to be solved to obtain the classification errors needed for the corrections. We propose approximating this integral by a summation over the empirical distribution of the response variables. The simulation study showed that the approaches work well, except for the combination of very badly separated components and a small sample size.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  • Bandeen-Roche, K., Miglioretti, D. L., Zeger, S. L., & Rathouz, P. J. (1997). Latent variable regression for multiple discrete outcomes. Journal of the American Statistical Association, 92, 1375–1386.

    Article  MathSciNet  MATH  Google Scholar 

  • Bolck, A., Croon, M. A., & Hagenaars, J. A. (2004). Estimating latent structural models with categorical variables: one – step versus three-step estimators. Political Analysis, 12, 3–27.

    Article  Google Scholar 

  • Dayton, C. M., & Macready, G. B. (1988). Concomitant-variable latent class models. Journal of the American Statistical Association, 83, 173–178.

    Article  MathSciNet  Google Scholar 

  • McLachlan, G., & Peel, D. (2000). Finite mixture models. New York: John Wiley.

    Book  MATH  Google Scholar 

  • Vermunt, J. K.(2010). Latent class modeling with covariates: two improved three-step approaches. Political Analysis, 18, 450–469.

    Article  Google Scholar 

  • Vermunt, J. K., & Magidson, J. (2005). Latent GOLD 4.0 user’s guide. Belmont: Statistical Innovations Inc.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Dereje W. Gudicha .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer International Publishing Switzerland

About this paper

Cite this paper

Gudicha, D.W., Vermunt, J.K. (2013). Mixture Model Clustering with Covariates Using Adjusted Three-Step Approaches. In: Lausen, B., Van den Poel, D., Ultsch, A. (eds) Algorithms from and for Nature and Life. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Cham. https://doi.org/10.1007/978-3-319-00035-0_8

Download citation

Publish with us

Policies and ethics