Multilevel Analysis in Higher Education Research: A Multidisciplinary Approach

Cheslock, John J.; Rios-Aguilar, Cecilia

doi:10.1007/978-94-007-0702-3_3

John J. Cheslock³ &
Cecilia Rios-Aguilar⁴

Part of the book series: Higher Education: Handbook of Theory and Research ((HATR,volume 26))

3036 Accesses
17 Citations

Abstract

Within higher education research, both hierarchical linear models (HLM) and econometric panel models are commonly employed in studies examining multilevel data. These two statistical traditions are interesting to compare, because despite a number of underlying similarities, they differ in complementary but often confusing ways. The confusion arises from varying terminology and model presentation, which makes almost identical models appear different. Econometrics textbooks focus on how the multilevel structure can be exploited to advance overall causal inference, while HLM texts primarily highlight opportunities to examine heterogeneity across groups. This chapter highlights the core similarities between these two traditions so that HLM-trained researchers can use their existing knowledge base to read econometric-based articles and vice versa. By contrasting these approaches, this chapter helps applied higher education researchers learn the full range of benefits allowed by the advanced analysis of multilevel data.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
To our knowledge, only one previous paper has spent considerable space directly comparing statistical models from these two specific traditions (HLM and econometrics), and this paper was prepared for a conference and not disseminated into the education research community (Chaplin, 2003).
2.
Of course, knowledge is only one of the challenges faced by higher education researchers when choosing the methodological approach in which to specialize. Students within higher education programs often face difficulty gaining access to graduate methodological classes in particular fields, and some programs, such as economics, teach in a manner that requires high levels of prior mathematical training. For these and several other reasons, higher education programs have historically funneled their students into methodology courses taught within educational psychology.
3.
Pooled OLS simply means an OLS regression that examines a data set that combines data from a number of different groups.
4.
This prediction clearly assumes that parental income is likely to be correlated with the propensity of students to be on the margin of attendance. Because price responsiveness and the number of submitted applications vary by a student’s parental income, such an assumption is realistic.
5.
To understand what we mean by a “small number,” consider the sample size requirements for statistical analysis in general. Now apply those requirements to each group in the analysis.
6.
The fourth benefit connects to multiple elements of Shadish, Cook, and Campbell’s validity typology. Because the upcoming discussion focuses on the trade-offs between validity types, we will focus solely on the first three benefits.
7.
For these points, our focus on simplicity is obscuring some important technical details. The HLM framework does not include direct analysis of the variation in β _0j and β _1j using equations (1)–(3). Instead, this variation is measured by estimating a different set of equations which do not contain W _j in equations (2) and (3). Raudenbush and Bryk (2002, p. 77–80) call this restricted version the “random coefficients” model. Using this model, one can examine the variation in β _0j by estimating the variance of u _0j. The variation in β _1j can be examined by estimating the variance of u _1j.
8.
Within HLM, the within-group estimator can also be obtained by including group means of each level-1 variable in each level-2 equation. In other words, we would add \({\overline{X}}_j \) as an independent variable to equations (3) and (4). Raudenbush and Bryk (2002) also discuss this approach.
9.
One can also think of the fixed-effects model as adding a dummy variable for each group. From this perspective, we are employing a fixed-effects model whenever we add dummy variables for any classification in which each observation is in one, but no more than one, category. Older versions of the Carnegie classification would be a good example from higher education research. For a multilevel data set with a large number of groups, the addition of dummy variables for each group creates computational challenges, which is why equation (10) is used instead.
10.
The coverage of other multilevel data structures is slightly larger in more advanced econometric textbooks, such as Wooldridge (2002). These books, however, still place a much larger emphasis on panel data. Individual journal articles, such as Moulton (1990) and Wooldridge (2003), focus on cluster samples in much more depth. These articles as well as advanced econometrics textbooks assume a strong existing knowledge base in econometrics and mathematics, so their usefulness will vary considerably across researchers.
11.
There are some prominent econometric papers, however, that focus on our third benefit of multilevel data. For example, Rivkin et al. (2005) used a data set containing students nested within teachers to estimate the variation in student test scores across teachers.
12.
Some papers that use alternative multilevel structures may also possess more realistic assumptions than the three papers we reviewed. As noted earlier in the chapter, difference-in-differences models can produce compelling results in certain contexts (Dynarski, 2000; Cornwell, Mustard, & Sridhar, 2006). Analysis of within-family differences can also be convincing, because siblings possess a number of shared traits and experiences (Ashenfelter & Rouse, 1998). Both difference-in-differences models and sibling studies are very common in economics.

References

Archibald, R., & Feldman, D. (2006). State higher education spending and the tax revolt. Journal of Higher Education, 77, 618–644.
Article Google Scholar
Ashenfelter, O., & Krueger, A. (1994). Estimates of the economic return to schooling from a new sample of twins. American Economic Review, 84(5), 1157–1173.
Google Scholar
Ashenfelter, O., & Rouse, C. (1998). Income, schooling, and ability: Evidence from a new sample of identical twins. Quarterly Journal of Economics, 113(1), 253–284.
Article Google Scholar
Baltagi, B. H. (2008). Econometric analysis of panel data (4th ed.). Chichester: Wiley.
Google Scholar
Becher, T., & Trowler, P. (2001). Academic tribes and territories: Intellectual enquiry and the cultures of disciplines. Buckingham: Society for Research in Higher Education/Open University Press.
Google Scholar
Berliner, C. (2002). Comment: Educational research: The hardest science of all. Educational Researcher, 31(8), 18–20.
Article Google Scholar
Breneman, D. (1994). Liberal arts colleges: Thriving, surviving, or endangered? Washington, DC: Brookings Institution.
Google Scholar
Bryk, A., & Raudenbush, S. (1992). Hierarchical linear models. Newbury Park, CA: Sage.
Google Scholar
Cameron, A., & Trivedi, P. (2005). Microeconometrics: Methods and applications. Cambridge: Cambridge University Press.
Book Google Scholar
Chamberlain, G. (1982). Multivariate regression models for panel data. Journal of Econometrics, 18, 5–46.
Article Google Scholar
Chamberlain, G. (1984). Panel data. In Z. Griliches & M. Intriligator (Eds.), Handbook of econometrics (Vol. 2, pp. 1247–1318). Amsterdam: North-Holland.
Google Scholar
Chaplin, D. (2003). Hierarchical linear models: Strengths and weaknesses. Paper prepared for the November 2003 meetings of the Association for Public Policy and Management.
Google Scholar
Cheslock, J., & Rios-Aguilar, C. (2008, November). Reaping (or not) the benefits of hierarchical data. Paper presented at the Association for the Study of Higher Education Meeting, Jacksonville, FL.
Google Scholar
Cornwell, C., Mustard, D., & Sridhar, D. (2006). The enrollment effects of merit-based financial aid: Evidence from Georgia’s hope program. Journal of Labor Economics, 24(4), 761–786.
Article Google Scholar
Cronbach, L. J. (1976). Research on classrooms and schools: Formulation of questions, design, and analysis, Occasional paper. Stanford, CA: Stanford Evaluation Consortium, Stanford University.
Google Scholar
Dale, S., & Krueger, A. (2002). Estimating the payoff to attending a more selective college: An application of selection on observables and unobservables. Quarterly Journal of Economics, 107(4), 1491–1527.
Article Google Scholar
Dedrick, R., et al. (2009). Multilevel modeling: A review of methodological issues and applications. Review of Educational Research, 79(1), 69–102.
Article Google Scholar
Dynarski, S. (2000). Hope for whom? Financial aid for the middle class and its impact on college attendance. National Tax Journal, 53(3), 629–662.
Google Scholar
Ethington, C. (1997). A hierarchical linear modeling approach to studying college effects. In J. Smart (Ed.), Higher education: Handbook of theory and research (Vol. 12, pp. 165–194). New York: Agathon Press.
Google Scholar
Gelman, A., & Hill, J. (2007). Data analysis using regression and multilevel/hierarchical models. Cambridge: Cambridge University Press.
Google Scholar
Gelman, A., & Little, T. (1997). Poststratification into many categories using hierarchical logistic regression. Survey Methodology, 23, 127–135.
Google Scholar
Gelman, A., Shor, B., Bafumi, J., & Park, D. (2007). Rich state, poor state, red state, blue state: What’s the matter with Connecticut. Quarterly Journal of Political Science, 2, 345–367.
Article Google Scholar
Goldhaber, D., & Brewer, D. (1997). Why don’t schools and teachers seem to matter? Assessing the impact of unobservables on educational productivity. Journal of Human Resources, 32(3), 505–523.
Article Google Scholar
Griliches, Z. (1979). Sibling models and data in economics: Beginnings of a survey. Journal of Political Economy, 87(5), S37–S64.
Article Google Scholar
Heck, R., & Thomas, S. (2009). An introduction to multilevel modeling techniques (2nd ed.). New York: Routledge/Taylor and Francis.
Google Scholar
Huber, P. (1967). The behavior of maximum-likelihood estimates under non-standard conditions. Paper presented at the Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability, Berkeley, CA.
Google Scholar
Kenny, D., Mannetti, L., Pierro, A., Livi, S., & Kashy, D. (2002). The statistical analysis of data from small groups. Journal of Personality and Social Psychology, 83, 126–137.
Article Google Scholar
Labaree, D. (2003). The peculiar problems of preparing educational researchers. Educational Researcher, 32(4), 13–22.
Article Google Scholar
Langenkamp, A. (2010). Academic vulnerability and resilience during transition to high school: The role of social relationships and district context. Sociology of Education, 83(1), 1–19.
Article Google Scholar
McPherson, M., & Schapiro, M. (1998). The student aid game: Meeting need and rewarding talent in American higher education. Princeton, NJ: Princeton University Press.
Google Scholar
Meyer, B. D. (1995). Natural and quasi-experiments in economics. Journal of Business and Economic Statistics, 13(2), 151–161.
Google Scholar
Moulton, B. (1990). An illustration of a pitfall in estimating the effects of aggregate variables on micro units. Review of Economics and Statistics, 72(2), 334–338.
Article Google Scholar
Mundlack, Y. (1978a). On the pooling of time series and cross section data. Econometrica, 46, 69–85.
Article Google Scholar
Mundlack, Y. (1978b). Models with variable coefficients: Integration and extension. Annales de l’INNSEE, 30–31, 483–509.
Google Scholar
Raudenbush, S., & Bryk, A. (1986). A hierarchical model for examining school effects. Sociology of Education, 59(1), 1–17.
Article Google Scholar
Raudenbush, S. W., & Bryk, A. S. (2002). Hierarchical linear models: Applications and data analysis methods (2nd ed.). Newbury Park, CA: Sage.
Google Scholar
Raudenbush, S., Bryk, A., Cheong, Y., Congdon, R., & du Toit, M. (2004). HLM 6: Linear and nonlinear modeling. Lincolnwood, IL: Scientific Software International Inc.
Google Scholar
Rivkin, S., Hanushek, E., & Kain, J. (2005). Teachers, schools, and academic achievement. Econometrica, 73(2), 417–458.
Article Google Scholar
Rogers, W. (1993). Regression standard errors in clustered samples. Stata Technical Bulletin, 13, 19–23.
Google Scholar
Shadish, W., Cook, T., & Campbell, D. (2002). Experimental & quasi-experimental designs for generalized causal inference. Boston: Houghton Mifflin.
Google Scholar
Smart, J. (2005). Attributes of exemplary research manuscripts employing quantitative analyses. Research in Higher Education, 46(4), 461–477.
Article Google Scholar
Snijders, T. (2005). Power and sample size in multilevel linear models. In B. Everitt & D. Howell (Eds.), Encyclopedia of statistics in behavioral science (Vol. 3, pp. 1570–1573). Chichester: Wiley.
Google Scholar
Snijders, T., & Bosker, R. (1999). Multilevel analysis: An introduction to basic and advanced multilevel modeling. Thousand Oaks, CA: Sage.
Google Scholar
Stock, J., & Watson, M. (2007). Introduction to econometrics, Addison-Wesley Series in Economics (2nd ed.). Boston: Pearson.
Google Scholar
Umbach, P. (2007). How effective are they? Exploring the impact of contingent faculty on undergraduate education. The Review of Higher Education, 30(2), 91–123.
Article Google Scholar
White, H. (1980). A heteroskedasticity-consistent covariance matrix estimator and a direct test for heteroskedasticity. Econometrica, 48, 817–830.
Article Google Scholar
Wooldridge, J. (2002). Econometric analysis of cross section and panel data. Cambridge, MA: MIT Press.
Google Scholar
Wooldridge, J. M. (2003). Cluster-sample methods in applied econometrics. American Economic Review, 93(2), 133–138.
Article Google Scholar
Wooldridge, J. (2009). Introductory econometrics: A modern approach (4th ed.). Mason, OH: South-Western Cengage Learning.
Google Scholar
Zhang, L. (2010). The use of panel data models in higher education policy studies. In J. Smart (Ed.), Higher education: Handbook of theory and research (Vol. 25, pp. 307–349). New York: Agathon Press.
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

Center for the Study of Higher Education, Pennsylvania State University, University Park, PA, 16802-3203, USA
John J. Cheslock
Center for the Study of Higher Education, University of Arizona, Tucson, AZ, 85721, USA
Cecilia Rios-Aguilar

Authors

John J. Cheslock
View author publications
You can also search for this author in PubMed Google Scholar
Cecilia Rios-Aguilar
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to John J. Cheslock .

Editor information

Editors and Affiliations

CEPR, 100 Ball Hall, Memphis, TN, 38152-3570, USA
John C. Smart
Dept. Educational Policy, &, Leadership Studies, University of Iowa, Lindquist Center N491, Iowa City, 52242-1529, Iowa, USA
Michael B. Paulsen

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Cheslock, J.J., Rios-Aguilar, C. (2011). Multilevel Analysis in Higher Education Research: A Multidisciplinary Approach. In: Smart, J., Paulsen, M. (eds) Higher Education: Handbook of Theory and Research. Higher Education: Handbook of Theory and Research, vol 26. Springer, Dordrecht. https://doi.org/10.1007/978-94-007-0702-3_3

Download citation

DOI: https://doi.org/10.1007/978-94-007-0702-3_3
Published: 11 March 2011
Publisher Name: Springer, Dordrecht
Print ISBN: 978-94-007-0701-6
Online ISBN: 978-94-007-0702-3
eBook Packages: Humanities, Social Sciences and LawEducation (R0)

Publish with us

Policies and ethics