Sankhya B

pp 1–27 | Cite as

A Copula-Based GLMM Model for Multivariate Longitudinal Data with Mixed-Types of Responses

  • Weiping Zhang
  • MengMeng Zhang
  • Yu ChenEmail author


We propose a copula-based generalized linear mixed model (GLMM) to jointly analyze multivariate longitudinal data with mixed types, including continuous, count and binary responses. The association of repeated measurements is modelled through the GLMM model, meanwhile a pair-copula construction (D-vine) is adopted to measure the dependency structure between different responses. By combining mixed models and D-vine copulas, our proposed approach could not only deal with unbalanced data with arbitrary margins but also handle moderate dimensional problems due to the efficiency and flexibility of D-vines. Based on D-vine copulas, algorithms for sampling mixed data and computing likelihood are also developed. Leaving the random effects distribution unspecified, we use nonparametric maximum likelihood for model fitting. Then an E-M algorithm is used to obtain the maximum likelihood estimates of parameters. Both simulations and real data analysis show that the nonparametric models are more efficient and flexible than the parametric models.

Keywords and phrases

Longitudinal data Mixed types Joint estimate D-vine copula Nonparametric maximum likelihood E-M algorithm 

AMS (2000) subject classification

Primary 62G05 Secondary 62J12 



We thank the Associate Editor and two referees for their constructive comments and suggestions that have greatly improved the paper. Zhang and Chen acknowledges support from the National Key Research and Development Plan under Grant 2016YFC0800100; the NSFC of China under Grant 11671374, 71771203 and 71631006.


  1. Aas, K., Czado, C., Frigessi, A. and Bakken, H. (2009). Pair-copula constructions of multiple dependence. Insurance: Mathematics and economics 44, 2, 182–198.MathSciNetzbMATHGoogle Scholar
  2. Aitkin, M. (1999). A general maximum likelihood analysis of variance components in generalized linear models. Biometrics 55, 1, 117–128.MathSciNetzbMATHGoogle Scholar
  3. Bandyopadhyay, S., Ganguli, B. and Chatterjee, A. (2011). A review of multivariate longitudinal data analysis. Statistical methods in medical research 20, 4, 299–330.MathSciNetzbMATHGoogle Scholar
  4. Bedford, T. and Cooke, R. M. (2002). Vines: a new graphical model for dependent random variables. Annals of Statistics 30, 4, 1031–1068.MathSciNetzbMATHGoogle Scholar
  5. Butler, S. M. and Louis, T. A. (1992). Random effects models with non-parametric priors. Statistics in Medicine 11, 14-15, 1981–2000.Google Scholar
  6. Chen, Y., Fei, Y. and Pan, J. (2015). Quasi-Monte Carlo estimation in generalized linear mixed model with correlated random effects. Open Access Library Journal 2, 10, 1.Google Scholar
  7. Cho, H. (2016). The analysis of multivariate longitudinal data using multivariate marginal models. Journal of Multivariate Analysis 143, 481–491.MathSciNetzbMATHGoogle Scholar
  8. Dißmann, J., Brechmann, E. C., Czado, C. and Kurowicka, D.s (2013). Selecting and estimating regular vine copulae and application to financial returns. Computational Statistics and Data Analysis 59, 52–69.MathSciNetzbMATHGoogle Scholar
  9. Feddag, M. L., Grama, I. and Mesbah, M. (2003). Generalized estimating equations (GEE) for mixed logistic models. Communications in Statistics-Theory and methods 32, 4, 851–874.MathSciNetzbMATHGoogle Scholar
  10. Fieuws, S., Verbeke, G., Maes, B. and Vanrenterghem, Y. (2007). Predicting renal graft failure using multivariate longitudinal profiles. Biostatistics 9, 3, 419–431.zbMATHGoogle Scholar
  11. Fleming, T. R. and Harrington, D. P. (1991). Counting Processes and Survival Analysis. Wiley, New York.zbMATHGoogle Scholar
  12. Gallant, A. R. and Nychka, D. W. (1987). Semi-nonparametric maximum likelihood estimation. Econometrica: Journal of the Econometric Society, 363–390.Google Scholar
  13. He, J., Li, H., Edmondson, A. C., Rader, D. J. and Li, M. (2012). A Gaussian copula approach for the analysis of secondary phenotypes in case–control genetic association studies. Biostatistics 13, 3, 497–508.zbMATHGoogle Scholar
  14. Jaffa, M. A., Gebregziabher, M., Luttrell, D. K., Luttrell, L. M. and Jaffa, A. A. (2016). Multivariate generalized linear mixed models with random intercepts to analyze cardiovascular risk markers in type-1 diabetic patients. Journal of Applied Statistics 43, 8, 1447–1464.MathSciNetGoogle Scholar
  15. Joe, H. (1996). Families of m-variate distributions with given margins and m (m-1)/2 bivariate dependence parameters. Lecture Notes-Monograph Series, 120–141.Google Scholar
  16. Joe, H. (1997). Multivariate Models and Multivariate Dependence Concepts. Chapman & Hall, London.zbMATHGoogle Scholar
  17. Killiches, M. and Czado, C. (2017). A D-vine copula based model for repeated measurements extending linear mixed models with homogeneous correlation structure. arXiv: 1705.06261.
  18. Kim, D., Kim, J. M., Liao, S. M. and Jung, Y. S. (2013). Mixture of D-vine copulas for modeling dependence. Computational Statistics & Data Analysis 64, 1–19.MathSciNetzbMATHGoogle Scholar
  19. Laird, N. M. and Ware, J. H. (1982). Random-effects models for longitudinal data. Biometrics, 963–974.Google Scholar
  20. Lambert, P. and Vandenhende, F. (2002). A copula-based model for multivariate non-normal longitudinal data: analysis of a dose titration safety study on a new antidepressant. Statistics in Medicine 21, 21, 3197–3217.Google Scholar
  21. Liang, K. Y. and Zeger, S. L. (1986). Longitudinal data analysis using generalized linear models. Biometrika 73, 1, 13–22.MathSciNetzbMATHGoogle Scholar
  22. Louis, T. A. (1982). Finding the observed information matrix when using the EM algorithm. Journal of the Royal Statistical Society. Series B (Methodological), 226–233.Google Scholar
  23. Min, Y. and Agresti, A. (2005). Random effect models for repeated measures of zero-inflated count data. Statistical Modelling 5, 1, 1–19.MathSciNetzbMATHGoogle Scholar
  24. Nelsen, R. (2006). An Introduction to Copulas, 2nd edn. Springer, New York.zbMATHGoogle Scholar
  25. Panagiotelis, A., Czado, C. and Joe, H. (2012). Pair copula constructions for multivariate discrete data. Journal of the American Statistical Association 107, 499, 1063–1072.MathSciNetzbMATHGoogle Scholar
  26. Pinheiro, J. C. and Bates, D. M. (1995). Approximations to the log-likelihood function in the nonlinear mixed-effects model. Journal of computational and Graphical Statistics 4, 1, 12–35.Google Scholar
  27. Rochon, J. (1996). Analyzing bivariate repeated measures for discrete and continuous outcome variables. Biometrics 52, 740–750.zbMATHGoogle Scholar
  28. Sklar, A. (1959). Fonctions de répartition à n dimensions et leurs marges. Publications of the Institute of Statistics, University of Paris, 8, 229–231.Google Scholar
  29. Smith, M., Min, A., Almeida, C. and Czado, C. (2010). Modeling longitudinal data using a pair-copula decomposition of serial dependence. Journal of the American Statistical Association 105, 492, 1467–1479.MathSciNetzbMATHGoogle Scholar
  30. Song, P. X. K., Li, M. and Yuan, Y. (2009). Joint regression analysis of correlated data using Gaussian copulas. Biometrics 65, 1, 60–68.MathSciNetzbMATHGoogle Scholar
  31. Sun, J., Frees, E. W. and Rosenberg, M. A. (2008). Heavy-tailed longitudinal data modeling using copulas. Insurance: Mathematics and Economics 42, 2, 817–830.zbMATHGoogle Scholar
  32. Verbeke, G. and Lesaffre, E. (1997). The effect of misspecifying the random-effects distribution in linear mixed models for longitudinal data. Computational Statistics & Data Analysis 23, 4, 541–556.MathSciNetzbMATHGoogle Scholar
  33. Verbeke, G., Fieuws, S., Molenberghs, G. and Davidian, M. (2014). The analysis of multivariate longitudinal data: a review. Statistical methods in medical research 23, 1, 42–59.MathSciNetGoogle Scholar
  34. Wang, Y. G. and Carey, V. (2003). Working correlation structure misspecification, estimation and covariate design: implications for generalised estimating equations performance. Biometrika 90, 1, 29–41.MathSciNetzbMATHGoogle Scholar
  35. Zeger, S. L. and Liang, K. Y. (1991). Feedback models for discrete and continuous time series. Statistica Sinica, 51–64.Google Scholar
  36. Zhang, D. and Davidian, M. (2001). Linear mixed models with flexible distributions of random effects for longitudinal data. Biometrics 57, 3, 795–802.MathSciNetzbMATHGoogle Scholar
  37. Zilko, A. A. and Kurowicka, D. (2016). Copula in a multivariate mixed discrete–continuous model. Computational Statistics and Data Analysis 103, 28–55.MathSciNetzbMATHGoogle Scholar

Copyright information

© Indian Statistical Institute 2019

Authors and Affiliations

  1. 1.Department of Statistics and FinanceUniversity of Science and Technology of ChinaHefeiChina

Personalised recommendations