A Note on the Calculation and Interpretation of the Delta-p Statistic for Categorical Independent Variables
- 380 Downloads
This methodological note illustrates how a commonly used calculation of the Delta-p statistic is inappropriate for categorical independent variables, and this note provides users of logistic regression with a revised calculation of the Delta-p statistic that is more meaningful when studying the differences in the predicted probability of an outcome between two or more groups. Although one cannot fully document the extent to which this error in the current calculation of the Delta-p statistic has spread across the field, the potential for error is far reaching as an increasing number of researchers use logistic regression to study categorical outcomes and as researchers look more closely at the Delta-p statistic as a means to communicate the results of logistic regression models to policy makers and administrators. It is recommended that higher education scholars and institutional researchers use caution when reporting the Delta-p statistic from prior studies and that they adopt the revised calculation of the Delta-p statistic presented in this methodological note when estimating logistic regression models with categorical independent variables.
KeywordsDelta-p statistic Logistic regression Predictor variables Higher education research Multivariate statistics
I would like to thank J. Scott Long, Thomas F. Nelson Laird, Stephen R. Porter, Robert K. Toutkoushian and two anonymous reviewers for commenting on previous drafts of this manuscript. Any errors are my own.
- Agresti, A. (2002). Categorical data analysis (2nd ed.). New Jersey: John Wiley and Sons, Inc.Google Scholar
- Cabrera, A. F. (1994). Logistic regression analysis in higher education: An applied perspective. In J. C. Smart (Ed.), Higher education: Handbook of theory and research (Vol. X, pp. 225–256). New York: Agathon Press.Google Scholar
- Cheng, S., & Long, J. S. (2000). XPost: Excel workbooks for the post-estimation interpretation of regression models for categorical dependent variables. Retrieved June 10, 2008 from http://www.indiana.edu/~jslsoc/web_spost/sp_xpost.htm
- Kauffman, R. L. (1996). Comparing effects in dichotomous logistic regression: A variety of standardized coefficients. Social Science Quarterly, 77, 90–109.Google Scholar
- Long, J. S. (1997). Regression models for categorical and limited dependent variables. Thousand Oaks, CA: Sage Publications, Inc.Google Scholar
- Long, J. S., & Freese, J. (2003). Regression models for categorical dependent variables using Stata. College Station, TX: Stata Press.Google Scholar
- Raudenbush, S. W., & Bryk, A. S. (2002). Hierarchical linear models: Applications and data analysis methods (2nd). Advanced quantitatie techniques in the social sciences series 1. Thousand Oaks, CA: Sage Publications.Google Scholar
- St. John, E. P., Hu, S., Simmons, A., & Musoba, G. D. (2001). Aptitude vs. merit: What matters in persistence. The Review of Higher Education, 24, 131–152.Google Scholar