Bayesian Methods and Model Selection for Latent Growth Curve Models with Missing Data

Lu, Zhenqiu (Laura); Zhang, Zhiyong; Cohen, Allan

doi:10.1007/978-1-4614-9348-8_18

Bayesian Methods and Model Selection for Latent Growth Curve Models with Missing Data

Zhenqiu (Laura) Lu⁵,
Zhiyong Zhang⁶ &
Allan Cohen⁵

Conference paper
First Online: 01 January 2014

1816 Accesses
4 Citations

Part of the book series: Springer Proceedings in Mathematics & Statistics ((PROMS,volume 66))

Abstract

With an increase in complexity of latent growth curve models (LGCMs), comes an increase in problems estimating the models. This research first proposes new growth models to address the perennial problems of almost all longitudinal research, namely, missing data. Different non-ignorable missingness models are formulated. These models include the latent coefficient (intercept or slope)-dependent missingness, in which the missing data rates vary across different latent individual initial levels or slopes; and the potential outcome-dependent missingness, in which the missing data rates on each occasion depend on potential outcomes. Second, this study proposes a full Bayesian approach to estimate the proposed LGCMs with non-ignorable missing data through data augmentation algorithm and Gibbs sampling procedure. And third, model selecting criteria are proposed in a Bayesian context to identify the best-fit model.Simulation studies were conducted. Conclusions include the proposed method can accurately recover model parameters, the mis-specified missingness may result in severely misleading conclusions, and almost all the model selection criteria can correctly identify the true model with high certainty. The application of the model and the method are illustrated with a longitudinal data set showing growth in mathematical ability. Finally, related implications of the approach and future research directions are discussed.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Hardcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

Akaike, H. (1974). A new look at the statistical model identification. IEEE Transactions on Automatic Control, 1919(6), 716–723.
Article MathSciNet MATH Google Scholar
Baraldi, A. N., & Enders, C. K. (2010). An introduction to modern missing data analyses. Journal of School Psychology, 48, 5–37.
Article Google Scholar
Bartholomew, D. J., & Knott, M. (1999). Latent variable models and factor analysis: Kendall’s library of statistics (2nd ed., Vol. 7). New York, NY: Edward Arnold.
MATH Google Scholar
Bollen, K., & Curran, P. (2006). Latent curve models: A structural equation perspective. Hoboken, NJ: Wiley.
MATH Google Scholar
Box, G. E. P., & Tiao, G. C. (1973). Bayesian inference in statistical analysis. Hoboken, NJ: Wiley.
MATH Google Scholar
Bozdogan, H. (1987). Model selection and Akaike’s Information Criterion (AIC): The general theory and its analytical extensions. Psychometrika, 52, 345–370.
Article MathSciNet MATH Google Scholar
Bureau of Labor Statistics, U.S. Department of Labor. (1997). National longitudinal survey of youth 1997 cohort, 1997–2003 (rounds 1–7). [computer file]. Produced by the National Opinion Research Center, the University of Chicago and distributed by the Center for Human Resource Research, The Ohio State University. Columbus, OH, 2005. Available from http://www.bls.gov/nls/nlsy97.htm
Celeux, G., Forbes, F., Robert, C., & Titterington, D. (2006). Deviance information criteria for missing data models. Bayesian Analysis, 4, 651–674.
Article MathSciNet MATH Google Scholar
Dempster, A. (1974). The direct use of likelihood for significance testing. In Proceedings of Conference on Foundational Questions in Statistical Inference (pp. 335–352). University of Aarhus: Aarhus.
Google Scholar
Enders, C. K. (2011). Missing not at random models for latent growth curve analyses. Psychological Methods, 16, 1–16.
Article Google Scholar
Fitzmaurice, G., Davidian, M., Verbeke, G., & Molenberghs, G. (Eds.). (2008). Longitudinal data analysis. Boca Raton, FL: Chapman & Hall.
Google Scholar
Fitzmaurice, G. M., Laird, N. M., & Ware, J. H. (2004). Applied longitudinal analysis. Hoboken, NJ: Wiley.
MATH Google Scholar
Geman, S., & Geman, D. (1984). Stochastic relaxation, Gibbs distributions, and the Bayesian restoration of images. IEEE Transactions on Pattern Analysis and Machine Intelligence, 6, 721–741.
Article MATH Google Scholar
Geweke, J. (1992). Evaluating the accuracy of sampling-based approaches to calculating posterior moments. In J. M. Bernado, J. O. Berger, A. P. Dawid, & A. F. M. Smith (Eds.), Bayesian Statistics 4 (pp. 169–193). Oxford, UK: Clarendon.
Google Scholar
Glynn, R. J., Laird, N. M., & Rubin, D. B. (1986). Drawing inferences from self-selected samples. In H. Wainer (Ed.), (pp. 115–142). New York: Springer.
Google Scholar
Huber, P. (1996). Robust statisticalprocedures (2nd ed.). Philadelphia: SIAM.
Google Scholar
Janssen, R., & De Boeck, P. (1999). Confirmatory analyses of componential test structure using multidimensional item response theory. Multivariate Behavioral Research, 34, 245–268.
Article Google Scholar
Jelicic, H., Phelps, E., & Lerner, R. M. (2009). Use of missing data methods in longitudinal studies: The persistence of bad practices in developmental psychology. Developmental Psychology, 45, 1195–1199.
Article Google Scholar
Lee, S. Y. (2007). Structural equation modeling: A Bayesian approach. Chichester, UK: Wiley.
Book Google Scholar
Little, R. J. A., & Rubin, D. B. (1987). Statistical analysis with missing data. New York, NY: Wiley.
MATH Google Scholar
Little, R. J. A., & Rubin, D. B. (2002). Statistical analysis with missing data (2nd ed.). New York: Wiley-Interscience.
Book MATH Google Scholar
Lu, Z., Zhang, Z., & Lubke, G. (2011). Bayesian inference for growth mixture models with latent-class-dependent missing data. Multivariate Behavioral Research, 46, 567–597.
Article Google Scholar
McLachlan, G. J., & Peel, D. (2000). Finite mixture models. New York, NY: Wiley.
Book MATH Google Scholar
Micceri, T. (1989). The unicorn, the normal curve and the other improbable creatures. Psychological Bulletin, 105, 156–166.
Article Google Scholar
Muthén, B., & Asparouhov, T. (2012). Bayesian SEM: A more flexible representation of substantive theory. Psychological Methods, 17(3), 313–335.
Article Google Scholar
Oldmeadow, C., & Keith, J. M. (2011). Model selection in Bayesian segmentation of multiple DNA alignments. Bioinformatics, 27, 604–610.
Article Google Scholar
Rissanen, J. (1978). Modeling by shortest data description. Automatica, 14, 465–471.
Article MATH Google Scholar
Robert, C. P., & Casella, G. (2004). Monte Carlostatistical methods. New York, NY: Springer.
Google Scholar
Roth, P. L. (1994). Missing data: A conceptual review for applied psychologists. Personnel Psychology, 47, 537–560.
Article Google Scholar
Schafer, J. L. (1997). Analysisof incomplete multivariate data. Boca Raton, FL: Chapman & Hall/CRC.
Book Google Scholar
Schwarz, G. E. (1978). Estimating the dimension of a model. Annals of Statistics, 6 (2), 461–464.
Article MathSciNet MATH Google Scholar
Sclove, L. S. (1987). Application of mode-selection criteria to some problems in multivariate analysis. Psychometrics, 52, 333–343.
Article Google Scholar
Singer, J. D., & Willett, J. B. (2003). Applied longitudinal data analysis: Modeling change and event occurrence. New York, NY: Oxford University Press.
Book Google Scholar
Spiegelhalter, D. J., Best, N. G., Carlin, B. P., & Van Der Linde, A. (2002). Bayesian measures of model complexity and fit. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 64(4), 583–639.
Article MathSciNet MATH Google Scholar
Spiegelhalter, D. J., Thomas, A., Best, N., & Lunn, D. (2003). WinBUGS manual Version 1.4. (Cambridge CB2 2SR, UK: MRC Biostatistics Unit, Institute of Public Health, Robinson Way. http://www.mrc-bsu.cam.ac.uk/bugs)
Sturtz, S., Ligges, U., & Gelman, A. (2005). R2WinBUGS: A package for running WinBUGS from R. Journal of Statistical Software, 12, 1–16.
Article Google Scholar
Tanner, M. A., & Wong, W. H. (1987). The calculation of posterior distributions by data augmentation. Journal of the American Statistical Association, 82, 528–540.
Article MathSciNet MATH Google Scholar
Yuan, K.- H., & Lu, Z. (2008). SEM with missing data and unknown population using two-stage ML: Theory and its application. Multivariate Behavioral Research, 43, 621–652.
Google Scholar
Zhang, Z., Hamagami, F., Wang, L., Grimm, K. J., & Nesselroade, J. R. (2007). Bayesian analysis of longitudinal data using growth curve models. International Journal of Behavioral Development, 31(4), 374–383.
Article Google Scholar

Download references

Author information

Authors and Affiliations

University of Georgia, Athens, GA, 30602, USA
Zhenqiu (Laura) Lu & Allan Cohen
University of Notre Dame, Notre Dame, IN, 46556, USA
Zhiyong Zhang

Authors

Zhenqiu (Laura) Lu
View author publications
You can also search for this author in PubMed Google Scholar
Zhiyong Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Allan Cohen
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Zhenqiu (Laura) Lu .

Editor information

Editors and Affiliations

Department of Psychology, Arizona State University, Tempe, AZ, USA
Roger E. Millsap
Department of Methodology and Statistics, Tilburg University, Tilburg, The Netherlands
L. Andries van der Ark
Department of Educational Psychology, University of Wisconsin, Madison, WI, USA
Daniel M. Bolt
Department of Psychology, University of Kansas, Lawrence, KS, USA
Carol M. Woods

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Lu, Z.(., Zhang, Z., Cohen, A. (2013). Bayesian Methods and Model Selection for Latent Growth Curve Models with Missing Data. In: Millsap, R.E., van der Ark, L.A., Bolt, D.M., Woods, C.M. (eds) New Developments in Quantitative Psychology. Springer Proceedings in Mathematics & Statistics, vol 66. Springer, New York, NY. https://doi.org/10.1007/978-1-4614-9348-8_18

Download citation

DOI: https://doi.org/10.1007/978-1-4614-9348-8_18
Published: 13 January 2014
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4614-9347-1
Online ISBN: 978-1-4614-9348-8
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)

Publish with us

Policies and ethics