Mathematical Approaches to Analysing Area-Level Effects on Health



This chapter discusses how and why multilevel modelling is a flexible and powerful tool for analysing data with a hierarchical structure. Such data are often found in social science and public health research (e.g. when analysing pupils who are nested in classes or schools, patients in clinics or hospitals).

The aim of multilevel modelling is to integrate the regression equation at a lower level of data grouping (usually individuals) with that at higher levels (such as classes, schools, neighbourhoods) into one regression equation and to incorporate covariates at appropriate levels. By using multilevel models, it is possible to adjust for similarity of the lower level units belonging to the same group of a higher level and to make overall inferences about relationships between lower level as well as higher level characteristics and the outcome of interest.

Using example data sets, we explain step by step how to conduct linear or logistic multilevel modelling. Additionally, we provide syntax commands for several software packages and demonstrate how to interpret the results of multilevel analyses.


Physical Activity Academic Performance Multilevel Modelling Active Transportation High Level Characteristic 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


  1. Bates, D., Maechler, M., & Bolker, B. (2011). lme4: Linear mixed-effects models using S4 classes. 0.999375-42 ed. p. R package. Accessed February 11, 2013, from
  2. Bijleveld, C. C. J. H., & van der Kamp, L. J. T. (Eds.). (1998). Longitudinal data analysis. Designs, models and methods. London: Sage.Google Scholar
  3. Diez-Roux, A. V. (2000). Multilevel analysis in public health research. Annual Review of Public Health, 21, 171–192.PubMedCrossRefGoogle Scholar
  4. Diez-Roux, A. V., & Aiello, A. E. (2005). Multilevel analysis of infectious diseases. The Journal of Infectious Diseases, 191(suppl 1), 25–33.CrossRefGoogle Scholar
  5. Dunson, D. (Ed.). (2008). Random effect and latent variable model selection (1st ed.). New York: Springer.Google Scholar
  6. Freedman, D. A. (2004). The ecological fallacy. In M. Lewis-Beck, A. Bryman, & T. F. Liao (Eds.), Encyclopedia of social science research methods. London: Sage.Google Scholar
  7. Graduate School of Education, Bristol Institute of Public Affairs. (2012). Centre for multilevel modelling. University-of-Bristol. AccessedFebruary 24, 2012, from
  8. Hox, J. (2002). Multilevel analysis. Techniques and applications. Mahwah, NJ: Lawrence Erlbaum Associates, Inc.Google Scholar
  9. Hox, J., & Maas, C. J. M. (2004). The influence of violations of assumptions on multilevel parameter estimates and their standard errors. Computational Statistics & Data Analysis, 46, 427–440.CrossRefGoogle Scholar
  10. Inagami, S., Cohen, D. A., Brown, A. F., & Asch, S. M. (2009). Body mass index, neighborhood fast food and restaurant concentration and car ownership. Journal of Urban Health, 86(5), 683–695.PubMedCrossRefGoogle Scholar
  11. Kleinbaum, D. G., & Klein, M. (2010). Logistic regression. A self-learning text (3rd ed.). New York: Springer.CrossRefGoogle Scholar
  12. Koren, A., & Mawn, B. (2010). The context of unintended pregnancy among married women in the USA. Journal of Family Planning and Reproductive Health Care, 36(3), 150–158.PubMedCrossRefGoogle Scholar
  13. Kothari, A. R., & Birch, S. (2004). Multilevel health promotion research: conceptual and analytical considerations. The Canadian Journal of Nursing Research, 36(1), 56–75.PubMedGoogle Scholar
  14. Kreft, I., & De Leeuw, J. (1998). Introducing multilevel modeling. London: Sage.Google Scholar
  15. Larsen, K., & Merlo, J. (2005). Appropriate assessment of neighbourhood effects on individual health: Integrating random and fixed effects in multilevel logistic regression. American Journal of Epidemiology, 161, 81–88.PubMedCrossRefGoogle Scholar
  16. Lehmann, E. L. (1986). Testing statistical hypotheses (2nd ed.). New York: Wiley.CrossRefGoogle Scholar
  17. Matthews, S. A., & Yang, T.-C. (2010). Exploring the role of the built and social neighborhood environment in moderating stress and health. Annals of Behavioral Medicine, 39, 170–183.PubMedCrossRefGoogle Scholar
  18. Merlo, J., Chaix, B., Ohlsson, H., Beckman, A., Johnell, K., Hjerpe, P., et al. (2006). A brief conceptual tutorial of multilevel analysis in social epidemiology: Using measures of clustering in multilevel logistic regression to investigate contextual phenomena. Journal of Epidemiology and Community Health, 60, 290–297.PubMedCrossRefGoogle Scholar
  19. Montgomery, D. C., & Peck, E. A. (1992). Introduction to linear regression analysis. New York: Wiley.Google Scholar
  20. Norman, C. D., Maley, O., Li, X., & Skinner, H. A. (2008). Using the internet to assist smoking prevention and cessation in schools: A randomized, controlled trial. Health Psychology, 27(6), 799–810.PubMedCrossRefGoogle Scholar
  21. Petree, R. D., Broome, K. M., & Bennett, J. B. (2012). Exploring and reducing stress in young restaurant workers: Results of a randomized field trial. American Journal of Health Promotion, 26(4), 217–224.PubMedCrossRefGoogle Scholar
  22. Pinheiro, J., Bates, D., DebRoy, S., & Sarkar, D., R-Development-Core-Team. (2011). nlme: Linear and nonlinear mixed effects models. R package version 3.1-102.Google Scholar
  23. R Development Core Team. (2011). R: A language and environment for statistical computing., Vienna.
  24. Raudenbush, S. W., & Bryk, A. S. (2002). Hierarchical linear models. Applications and data analysis methods (2nd ed.). London: Sage.Google Scholar
  25. SAS System for Windows. (2008). (9.2 ed.). Cary, NC: SAS Institute Inc.Google Scholar
  26. Snijders, T. A. B., & Bosker, R. J. (1999). Multilevel analysis. An introduction to basic and advanced multilevel modeling. London: Sage.Google Scholar
  27. SPSS for Windows. (2011). (20.0.0 ed.). Chicago: SPSS Inc.Google Scholar
  28. StataCorp. (2007). Stata statistical software, Stata/IC (101st ed.). College Station, TX: StataCorp LP.Google Scholar
  29. Stock, C., Bloomfield, K., Ejstrud, B., Vinther-Larsen, M., Meijer, M., Grønbæk, M., et al. (2011). Are characteristics of the school district associated with active transportation to school in Danish adolescents? European Journal of Public Health, 22(3), 398–404.PubMedCrossRefGoogle Scholar
  30. Torsheim, T., Currie, C., Boyce, W., Kalnins, I., Overpeck, M., & Haugland, S. (2004). Material deprivation and self-rated health: A multilevel study of adolescents from 22 European and North American countries. Social Science & Medicine, 59, 1–12.CrossRefGoogle Scholar
  31. Venables, W. N., & Ripley, B. D. (2002). Modern applied statistics with S. New York: Springer.CrossRefGoogle Scholar
  32. Verbeke, G., & Molenberghs, G. (2000). Linear mixed models for longitudinal data. New York: Springer.Google Scholar
  33. Victor, R. G., Ravenell, J. E., Freeman, A., Bhat, D. G., Storm, J. S., Shafig, M., et al. (2009). A barber-based intervention for hypertension in African-American men: Design of a group randomized trial. American Heart Journal, 157(1), 30–36.PubMedCrossRefGoogle Scholar
  34. Zuur, A. F., Ieno, E. N., Walker, N. J., Saveliev, A. A., & Smith, G. M. (Eds.). (2009). Mixed effects models and extensions in ecology with R. New York: Springer.Google Scholar

Copyright information

© Springer Science+Business Media New York 2013

Authors and Affiliations

  1. 1.Department for Biostatistics and Clinical EpidemiologyCharité-University MedicineBerlinGermany
  2. 2.Center for Alcohol and Drug Research, School of Business and Social ScienceAarhus UniversityCopenhagen SDenmark

Personalised recommendations