An Introduction to (Generalized (Non)Linear Mixed Models

  • Geert Molenberghs
  • Geert Verbeke
Part of the Statistics for Social Science and Public Policy book series (SSBS)


In applied sciences, one is often confronted with the collection of correlated data or otherwise hierarchical data. This generic term embraces a multitude of data structures, such as multivariate observations, clustered data, repeated measurements (called ‘repeated observations’ in this volume), longitudinal data, and spatially correlated data. In particular, studies are often designed to investigate changes in a specific parameter which is measured repeatedly over time in the participating persons. This is in contrast to cross-sectional studies where the response of interest is measured only once for each individual. Longitudinal studies are conceived for the investigation of such changes, together with the evolution of relevant covariates.


Linear Mixed Model Generalize Linear Mixed Model American Statistical Association Exponential Family Royal Statistical Society 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. Aerts, M., Geys, H., Molenberghs, G., & Ryan, L.M. (2002). Topics in Modeling of Clustered Binary Data. London: Chapman & Hall.CrossRefGoogle Scholar
  2. Afifi, A., & Elashoff, R. (1966). Missing observations in multivariate statistics I: Review of the literature. Journal of the American Statistical Association, 61, 595–604.MathSciNetGoogle Scholar
  3. Agresti, A. (1990) Categorical Data Analysis. New York: John Wiley & Sons.zbMATHGoogle Scholar
  4. Akaike, H. (1974) A new look at the statistical model identification. IEEE Transactions on Automatic Control, 19, 716–723.MathSciNetzbMATHCrossRefGoogle Scholar
  5. Altham, P.M.E. (1978). Two generalizations of the binomial distribution. Applied Statistics, 27, 162–167.MathSciNetzbMATHCrossRefGoogle Scholar
  6. Bahadur, R.R. (1961). A representation of the joint distribution of responses of P dichotomous items. In H. Solomon (Ed.), Studies in Item Analysis and Prediction (pp. 158–168). Stanford, CA: Stanford University Press.Google Scholar
  7. Besag, J., Green, P.J., Higdon, D., & Mengersen, K. (1995). Bayesian computation and stochastic systems. Statistical Science, 10, 3–66.MathSciNetzbMATHCrossRefGoogle Scholar
  8. Böhning, D. (1999). Computer-Assisted Analysis of Mixtures and Applications: Meta-analysis, Disease Mapping and Others. London: Chapman & Hall.Google Scholar
  9. Breslow, N.E., & Clayton, D.G. (1993). Approximate inference in generalized linear mixed models. Journal of the American Statistical Association, 88, 9–25.zbMATHGoogle Scholar
  10. Breslow, N.E., & Day, N.E. (1987). Statistical Methods in Cancer Research, Volume II. Oxford: Oxford University Press.Google Scholar
  11. Conaway, M. (1989). Analysis of repeated categorical measurements with conditional likelihood methods. Journal of the American Statistical Association, 84, 53–62.MathSciNetCrossRefGoogle Scholar
  12. Cox, D.R. (1972). The analysis of multivariate binary data. Applied Statistics, 21, 113–120.CrossRefGoogle Scholar
  13. Cressie, N.A.C. (1991). Statistics for Spatial Data. New York: Wiley.zbMATHGoogle Scholar
  14. Dale, J.R. (1986). Global cross-ratio models for bivariate, discrete, ordered responses. Biometrics, 42, 909–917.CrossRefGoogle Scholar
  15. De Backer, M., De Vroey, C., Lesaffre, E., Scheys, I., & De Keyser, P. (1998). Twelve weeks of continuous oral therapy for toenail onychomycosis caused by dermatophytes: A double-blind comparative trial of terbina-fine 250 mg/day versus itraconazole 200 mg/day. Journal of the American Academy of Dermatology, 38, S57–63.CrossRefGoogle Scholar
  16. Declerck, L., Aerts, M., & Molenberghs, G. (1998). Behaviour of the likelihood-ratio test statistic under a Bahadur model for exchangeable binary data. Journal of Statistical Computations and Simulations, 61, 15–38.MathSciNetzbMATHCrossRefGoogle Scholar
  17. Dempster, A.P., Laird, N.M., & Rubin, D. B. (1977). Maximum likelihood from incomplete data via the EM algorithm (with discussion). Journal of the Royal Statistical Society, Series B, 39, 1–38.MathSciNetzbMATHGoogle Scholar
  18. Diggle, P.J. (1983). Statistical Analysis of Spatial Point Patterns. London: Academic Press.zbMATHGoogle Scholar
  19. Diggle, P.J. (1990). Time Series: A Bio statistical Introduction. Oxford: Oxford University Press.Google Scholar
  20. Diggle, P.J., Heagerty, P., Liang, K-Y., & Zeger, S.L. (2002). Analysis of Longitudinal Data. New York: Oxford University Press.Google Scholar
  21. Diggle, P.J., & Kenward, M.G. (1994). Informative drop-out in longitudinal data analysis (with discussion). Applied Statistics, 43, 49–93.zbMATHCrossRefGoogle Scholar
  22. Efron, B. (1986). Double exponential families and their use in generalized linear regression. Journal of the American Statistical Association, 81, 709–721.MathSciNetzbMATHCrossRefGoogle Scholar
  23. Fahrmeir, L., & Tutz, G. (2001). Multivariate Statistical Modeling Based on Generalized Linear Models. Heidelberg: Springer.CrossRefGoogle Scholar
  24. Gamerman, D. (1997). Efficient sampling from the posterior distribution in generalized linear mixed models. Statistics and Computing, 7, 57–68.CrossRefGoogle Scholar
  25. Gelman, A., Carlin, J.B., Stern, H.S., & Rubin, D.B. (2004). Bayesian Data Analysis. London: Chapman & Hall.zbMATHGoogle Scholar
  26. Gilmour, A.R., Anderson, R.D., & Rae, A.L. (1985). The analysis of binomial data by a generalized linear mixed model. Biometrika, 72, 593–599.MathSciNetCrossRefGoogle Scholar
  27. Gilula, Z., & Haberman, S. (1994). Conditional log-linear models for analyzing categorical panel data. Journal of the American Statistical Association, 89, 645–656.MathSciNetzbMATHCrossRefGoogle Scholar
  28. Goldstein, H. (1986). Multilevel mixed linear model analysis using iterative generalized least squares. Biometrika, 73, 43–56.MathSciNetzbMATHCrossRefGoogle Scholar
  29. Goldstein, H. (1991). Nonlinear multilevel models, with an application to discrete response data. Biometrika, 78, 45–51.MathSciNetCrossRefGoogle Scholar
  30. Goldstein, H. (1995). Multilevel Statistical Models (2nd ed.). London: Arnold.Google Scholar
  31. Goldstein, H. (2003). Multilevel Statistical Models (3rd ed.). London: Arnold.zbMATHGoogle Scholar
  32. Goldstein, H., & Rasbash, J. (1996). Improved approximations for multilevel models with binary responses. Journal of the Royal Statistical Society A, 159, 505–513.MathSciNetzbMATHCrossRefGoogle Scholar
  33. Hartley, H.O., & Hocking, R. (1971). The analysis of incomplete data. Biometrics, 27, 7783–808.CrossRefGoogle Scholar
  34. Harville, D.A. (1974). Bayesian inference for variance components using only error contrasts. Biometrika, 61, 383–385.MathSciNetzbMATHCrossRefGoogle Scholar
  35. Hobert, J.P., & Casella, G. (1996). The effect of improper priors on Gibbs sampling in hierarchical linear mixed models. Journal of the American Statistical Association, 91, 1461–1473.MathSciNetzbMATHCrossRefGoogle Scholar
  36. Kenward, M.G., & Roger, J.H. (1997). Small sample inference for fixed effects from restricted maximum likelihood. Biometrics, 53, 983–997.zbMATHCrossRefGoogle Scholar
  37. Kleinman, J. (1973). Proportions with extraneous variance: single and independent samples. Journal of the American Statistical Association, 68, 46–54.Google Scholar
  38. Knorr-Held, L. (1997). Hierarchical Modeling of Discrete Longitudinal Data; Applications of Markov Chain Monte Carlo. München: Utz.Google Scholar
  39. Kuk, A.Y.C. (1995). Asymptotically unbiased estimation in generalised linear models with random effects. Journal of the Royal Statistical Society B,57, 395–407.Google Scholar
  40. Kupper, L.L., & Haseman, J.K. (1978). The use of a correlated binomial model for the analysis of certain toxicology experiments. Biometrics, 34, 69–76.zbMATHCrossRefGoogle Scholar
  41. Lang, J.B., & Agresti, A. (1994). Simultaneously modeling joint and marginal distributions of multivariate categorical responses. Journal of the American Statistical Association, 89, 625–632.zbMATHCrossRefGoogle Scholar
  42. Lavergne, C., & Trottier, C. (2000). Sur l’estimation dans les modèles linéaires généralisés à effets aléatoires. Revue de Statistique Appliquée, 48, 49–67.Google Scholar
  43. Lee, Y., & Neider, J.A. (1996). Hierarchical generalized linear models (with discussion). Journal of the Royal Statistical Society, Series B, 58, 619–678.zbMATHGoogle Scholar
  44. Liang, K.-Y., & Zeger, S.L. (1986). Longitudinal data analysis using generalized linear models. Biometrika, 73, 13–22.MathSciNetzbMATHCrossRefGoogle Scholar
  45. Liang, K.Y., Zeger, S.L., & Qaqish, B. (1992). Multivariate regression analyses for categorical data. Journal of the Royal Statistical Society, Series B, 54, 3–40.MathSciNetzbMATHGoogle Scholar
  46. Lin, X., & Breslow, N.E. (1996). Bias correction in generalized linear mixed models with multiple components of dispersion. Journal of the American Statistical Association, 91, 1007–1016.MathSciNetzbMATHCrossRefGoogle Scholar
  47. Lipsitz, S.R., Laird, N.M., & Harrington, D.P. (1991). Generalized estimating equations for correlated binary data: using the odds ratio as a measure of association. Biometrika, 78, 153–160.MathSciNetCrossRefGoogle Scholar
  48. Little, R.J.A., & Rubin, D.B. (1987). Statistical Analysis with Missing Data. New York: Wiley.zbMATHGoogle Scholar
  49. Longford, N.T. (1993). Random Coefficient Models. London: Oxford University Press.zbMATHGoogle Scholar
  50. McCullagh, P., & Neider, J.A. (1989). Generalized Linear Models. London: Chapman & Hall.zbMATHGoogle Scholar
  51. Molenberghs, G., & Lesaffre, E. (1994). Marginal modeling of correlated ordinal data using a multivariate Plackett distribution. Journal of the American Statistical Association, 89, 633–644.zbMATHCrossRefGoogle Scholar
  52. Molenberghs, G., & Lesaffre, E. (1999). Marginal modeling of multivariate categorical data. Statistics in Medicine, 18, 2237–2255.CrossRefGoogle Scholar
  53. Murray, G.D., & Findlay, J.G. (1988). Correcting for the bias caused by drop-outs in hypertension trials. Statististics in Medicine, 7, 941–946.CrossRefGoogle Scholar
  54. Neider, J.A., & Wedderburn, R.W.M. (1972). Generalized linear models. Journal of the Royal Statistical Society, Series B, 135, 370–384.CrossRefGoogle Scholar
  55. Neuhaus, J.M. (1992). Statistical methods for longitudinal and clustered designs with binary responses. Statistical Methods in Medical Research, 1, 249–273.CrossRefGoogle Scholar
  56. Neuhaus, J.M., Kalbfleisch, J.D., & Hauck, W.W. (1991). A comparison of cluster-specific and population-averaged approaches for analyzing correlated binary data. International Statistical Review, 59, 25–35.CrossRefGoogle Scholar
  57. Pinheiro, J.C., & Bates, D.M. (1995). Approximations to the log-likelihood function in the nonlinear mixed-effects model. Journal of Computational and Graphical Statistics, 4, 12–35.Google Scholar
  58. Pinheiro, J.C., & Bates D.M. (2000). Mixed Effects Models in S and S-PLUS. New-York: Springer.zbMATHCrossRefGoogle Scholar
  59. Plackett, R.L. (1965). A class of bivariate distributions. Journal of the American Statistical Association, 60, 516–522.MathSciNetCrossRefGoogle Scholar
  60. Prentice, R.L. (1988). Correlated binary regression with covariates specific to each binary observation. Biometrics, 44, 1033–1048.MathSciNetzbMATHCrossRefGoogle Scholar
  61. Press, W.H., Teukolsky, S.A., Vetterling, W.T., & Flannery, B.P. (1992). Numerical recipes in FORTRAN. (2nd ed.). Cambridge, MA: Cambridge University Press.zbMATHGoogle Scholar
  62. Raftery, A.E. (1995). Bayesian model selection in social research. Sociological Methodology, 25, 111–163.CrossRefGoogle Scholar
  63. Raubertas, R.F., Lee, C.I.C., & Nordheim, E.V. (1986). Hypothesis tests for normal means constrained by linear inequalities. Communications in Statistics-Theory and Methods 15, 2809–2833.MathSciNetzbMATHCrossRefGoogle Scholar
  64. Renard, D., Molenberghs, G., & Geys, H. (2002). A paijwise likelihood approach to estimation in multilevel probit models. Computational Statistics and Data Analysis. Manuscript accepted for publication.Google Scholar
  65. Ripley, B.D. (1981). Spatial Statistics. New York: Wiley.zbMATHCrossRefGoogle Scholar
  66. Ripley, B.D. (1987). Stochastic Simulation. New York: Wiley.zbMATHCrossRefGoogle Scholar
  67. Robins, J.M., Rotnitzky, A., & Zhao, L.P. (1995). Analysis of semiparamet-ric regression models for repeated outcomes in the presence of missing data. Journal of the American Statistical Association, 90, 106–121.MathSciNetzbMATHCrossRefGoogle Scholar
  68. Rodriguez, G., & Goldman, N. (1995). An assessment of estimation procedures for multilevel models with binary responses. Journal of the Royal Statistical Society A, 158, 73–89.CrossRefGoogle Scholar
  69. Rosner, B. (1984). Multivariate methods in opthalmology with applications to other paired-data. Biometrics, 40, 1025–1035.CrossRefGoogle Scholar
  70. Rubin, D.B. (1976). Inference and missing data. Biometrika, 63, 581–592.MathSciNetzbMATHCrossRefGoogle Scholar
  71. Rubin, D.B. (1987). Multiple Imputation for Nonresponse in Surveys. New York: Wiley.CrossRefGoogle Scholar
  72. Rubin, D.B., Stern, H.S., & Vehovar V. (1995). Handling “don’t know” survey responses: the case of the Slovenian plebiscite. Journal of the American Statistical Association, 90, 822–828.Google Scholar
  73. Satterthwaite, F.E. (1941). Synthesis of variance. Psychometrika, 6, 309–316.MathSciNetzbMATHCrossRefGoogle Scholar
  74. Schall, R. (1991). Estimation in generalised linear models with random effects. Biometrika, 78, 719–727.zbMATHCrossRefGoogle Scholar
  75. Schwarz, G. (1978). Estimating the dimension of a model. The Annals of Statistics, 6, 461–464.MathSciNetzbMATHCrossRefGoogle Scholar
  76. Shapiro, A. (1988). Towards a unified theory of inequality constrained testing in multivariate analysis. International Statistical Review 56, 49–62.MathSciNetzbMATHCrossRefGoogle Scholar
  77. Silvapulle, M.J., & Silvapulle, P. (1995). A score test against one-sided alternatives. Journal of the American Statistical Association 90, 342–349.MathSciNetzbMATHCrossRefGoogle Scholar
  78. Skellam, J.G. (1948). A probability distribution derived from the binomial distribution by regarding the probability of success as variable between the sets of trials. Journal of the Royal Statistical Society, Series B, 10, 257–261.MathSciNetzbMATHGoogle Scholar
  79. Spiessens, B., Lesaffre, E., Verbeke, G., & Kim, K. (2002). Group sequential methods for an ordinal logistic random-effects model under misspecifica-tion. Biometrics, 58, 569–575.MathSciNetzbMATHCrossRefGoogle Scholar
  80. Spiessens, B., Lesaffre, E., Verbeke, G., & Kim, K. (2003). The use of mixed models for longitudinal count data when the random-effects distribution is misspecified. Manuscript submitted for publication.Google Scholar
  81. Stiratelli, R., Laird, N., & Ware, J.H. (1984). Random-effects model for serial observations with binary response. Biometrics, 40, 961–971.CrossRefGoogle Scholar
  82. Stram, D.O., & Lee, J.W. (1994). Variance components testing in the longitudinal mixed-effects model. Biometrics, 50, 1171–1177.zbMATHCrossRefGoogle Scholar
  83. Stram, D.A., & Lee, J.W. (1995). Correction to: Variance components testing in the longitudinal mixed-effects model. Biometrics, 51, 1196.Google Scholar
  84. Ten Have, T.R., Landis, R., & Weaver, S.L. (1995). Association models for periodontal disease progression: A comparison of methods for clustered binary data. Statistics in Medicine, 14, 413–429.CrossRefGoogle Scholar
  85. Thélot, C. (1985). Lois logistiques à deux dimensions. Annales de l’Insée, 58, 123–149.Google Scholar
  86. Verbeke, G., & Lesaffre, E. (1996). A linear mixed-effects model with heterogeneity in the random-effects population. Journal of the American Statistical Association, 91, 217–221.zbMATHCrossRefGoogle Scholar
  87. Verbeke, G., & Molenberghs, G. (1997). Linear Mixed Models in Practice: A SAS-Oriented Approach. Lecture Notes in Statistics 126. New York: Springer.zbMATHCrossRefGoogle Scholar
  88. Verbeke, G., & Molenberghs, G. (2000). Linear Mixed Models for Longitudinal Data. New York: Springer.zbMATHGoogle Scholar
  89. Verbeke, G., & Molenberghs, G. (2003). The use of score tests for inference on variance components. Biometrics, 59, 254–262.MathSciNetzbMATHCrossRefGoogle Scholar
  90. Wedderburn, R.W.M. (1974). Quasi-likelihood functions, generalized linear models, and the Gauss-Newton method. Biometrika, 61, 439–447.MathSciNetzbMATHGoogle Scholar
  91. Wolfinger, R. (1998). Towards practical application of generalized linear mixed models. In B. Marx and H. Friedl (Eds), Proceedings of the 13th International Workshop on Statistical Modeling (pp. 388–395). New Orleans, Louisiana.Google Scholar
  92. Wolfinger, R., & O’Connell, M. (1993). Generalized linear mixed models: A pseudo-likelihood approach. Journal of Statistical Computing and Simulation, 48, 233–243.zbMATHCrossRefGoogle Scholar
  93. Zeger, S.L., & Karim, M.R. (1991). Generalised linear models with random effects: a Gibbs sampling approach. Journal of the American Statistical Association, 86, 79–102.MathSciNetCrossRefGoogle Scholar
  94. Zhao, L.P., & Prentice, R.L. (1990). Correlated binary regression using a quadratic exponential model. Biometrika, 77, 642–648.MathSciNetCrossRefGoogle Scholar

Copyright information

© Springer Science+Business Media New York 2004

Authors and Affiliations

  • Geert Molenberghs
  • Geert Verbeke

There are no affiliations available

Personalised recommendations