Selection bias in linear mixed models
- 138 Downloads
The paper investigates the consequences of sample selection in multilevel or mixed models, focusing on the random intercept two-level linear model under a selection mechanism acting at both hierarchical levels. The behavior of sample selection and the resulting biases on the regression coefficients and on the variance components are studied both theoretically and through a simulation study. Most theoretical results exploit the properties of Normal and Skew-Normal distributions. The analysis allows to outline a taxonomy of sample selection in the multilevel framework that can support the qualitative assessment of the problem in specific applications and the development of suitable techniques for diagnosis and correction.
Key WordsClustered data Multilevel model Random effects Sample selection Skew-Normal distributions Truncation
Unable to display preview. Download preview PDF.
- Borgoni, R. and Billari, F. C. (2002) A multilevel sample selection probit model with an application to contraceptive use, In: Proceedings of the XLI meeting of the Italian Statistical Society, Padova, CLEUP.Google Scholar
- Goldstein, H. (2003) Multilevel statistical models, (3rd), New York, Oxford University Press.Google Scholar
- Grilli, L. and Rampichini, C. (2007b) Selection bias in linear mixed models, Electronic Working Papers of the Department of Statistics, No. 2007/10, University of Florence.Google Scholar
- Jensen, P., Rosholm, M. and Verner, M. (2002) A Comparison of different estimators for panel data sample selection models, Economics Working Paper No. 2002-1, University of Aarhus.Google Scholar
- Littell, R., Milliken, G., Stroup, W., Wolfinger, R. and Schabenberber, O. (2006) SAS for Mixed Models, Second Edition, Cary, SAS Institute Inc.Google Scholar
- Little, R. J. A. and Rubin, D. B. (2002) Statistical analysis with missing data, (2nd), New York, Wiley.Google Scholar
- Saha, C. and Jones, M. P. Asymptotic bias in the linear mixed effects model under non-ignorable missing data mechanisms, Journal of the Royal Statistical Society B, 67, 167–182.Google Scholar