Abstract
Many fields of study use longitudinal datasets, which usually consist of repeated measurements of a response variable, often accompanied by a set of covariates for each of the subjects/units. However, longitudinal datasets are problematic because they inherently show correlation due to a subject’s repeated set of measurements. For example, one might expect a correlation to exist when looking at a patient’s health status over time or a student’s performance over time. But in those cases, when the responses are correlated, we cannot readily obtain the underlying joint distribution; hence, there is no closed-form joint likelihood function to present, as with the standard logistic regression model. One remedy is to fit a generalized estimating equations (GEE) logistic regression model for the data, which is explored in this chapter. This chapter addresses repeated measures of the sampling unit, showing how the GEE method allows missing values within a subject without losing all the data from the subject, and time-varying predictors that can appear in the model. The method requires a large number of subjects and provides estimates of the marginal model parameters. We fit this model in SAS, SPSS, and R, basing our work on the variance means relationship methods, Ziang and Leger (Biometrics 42:121–130, 1986a, Biometrics 73:13–22, 1986b), and Liang and Zeger (Biometrika 73:13–22, 1986).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Ballinger, G. A. (2004). Using generalized estimating equations for longitudinal data analysis. Organizational Research Methods, 7, 127–150.
Breslow, N. E. (1989). Score tests in overdispersed GLMs. In A. Decarli, B. J. Francis, R. Gilchrist, & G. U. H. Seeber (Eds.), Workshop on statistical modeling (pp. 64–74). New York: Springer.
Davidian, M., & Carroll, R. J. (1987). Variance function estimation. Journal of American Statistical Association, 82, 1079–1091.
Diggle, P. J., Liang, K. Y., & Zeger, S. L. (1994). Analysis of longitudinal data. New York: Oxford University Press.
Galbraith, S., Daniel, J. A., & Vissel, B. (2010). A study of clustered data and approaches to its analysis. Journal of Neuroscience, 30, 10601–10608.
Gibbons, R. D., & Hedeker, D. H. (1997). Random effects probit and logistic regression models for three-level data. Biometrics, 53, 1527–1537.
Hardin, J. W., & Hilbe, J. M. (2003). Generalized estimating equations. New York: Wiley.
Hu, F. B., Goldberg, J., Hedeker, D., Flay, B. R., & Pentz, M. A. (1998). Comparison of population-averaged and subject-specific approaches for analyzing repeated binary outcomes. American Journal of Epidemiology, 147(7), 694–703.
Liang, K. Y., & Zeger, S. L. (1986). Longitudinal data analysis using generalized linear models. Biometrika, 73(1), 13–22.
McCullagh, P., & Nelder, J. (1989). Generalized linear models (2nd ed.). London: Chapman and Hall.
Pan, W., & Connett, J. E. (2002). Selecting the working correlation structure in generalized estimating equations with application to the lung health study. Statistica Sinica, 12(2), 475–490.
Sullivan Pepe, M., & Anderson, G. L. (1994). A cautionary note on inference for marginal regression models with longitudinal data and general correlated response data. Communications in Statistics—Simulation and Computation, 23(4), 939–951.
Wilson, P. M., & Wilson, J. R. (1992). Environmental influences on adolescent educational aspirations: A logistic transform model. Youth & Society, 24(1), 52–70.
Zeger, S. L., & Liang, K. Y. (1986a). Longitudinal data analysis for discrete and continuous outcomes. Biometrics, 42, 121–130.
Zeger, S. L., & Liang, K. Y. (1986b). Longitudinal data analysis using generalized linear models. Biometrics, 73, 13–22.
Zeger, S. L., & Liang, K. Y. (1992). An overview of methods for the analysis of longitudinal data. Statistics in Medicine, 11(14–15), 1825–1839.
Author information
Authors and Affiliations
1 Electronic Supplementary Material
Below is the link to the electronic supplementary material.
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this chapter
Cite this chapter
Wilson, J.R., Lorenz, K.A. (2015). Generalized Estimating Equations Logistic Regression. In: Modeling Binary Correlated Responses using SAS, SPSS and R. ICSA Book Series in Statistics, vol 9. Springer, Cham. https://doi.org/10.1007/978-3-319-23805-0_6
Download citation
DOI: https://doi.org/10.1007/978-3-319-23805-0_6
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-23804-3
Online ISBN: 978-3-319-23805-0
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)