Abstract
A good aphorism can, in a few words, capture an essential truth. Of the many good aphorisms Paul Holland has coined over the years, I have found myself invoking the one above frequently enough to worry that I should be paying out royalty fees, so it is only fitting that I use it as the starting point for some ideas I wish to explore in this paper.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
It would also be possible to compare a single tutoring program to a control condition of no tutoring, but this comparison would introduce a clear source of bias in the sense that students enrolled in tutoring are likely to be more motivated than those who are not.
References
American Educational Research Association, American Psychological Association, & National Council on Measurement in Education. (1999). Validity. In Standards for educational and psychological testing (pp. 9–24). Washington, DC: American Educational Research Association.
Ballou, D., Sanders, W., & Wright, P. (2004). Controlling for student background in value-added assessment of teachers. Journal of Educational and Behavioral Statistics, 29(1), 37–65.
Borsboom, D., Mellenbergh, G., & Heerden, J. (2004). The concept of validity. Psychological Review, 111(4), 1061–1071.
Briggs, D. C. (2008). Synthesizing causal inferences. Educational Researcher, 37(1), 15–22.
Briggs, D. C., & Wiley, E. (2008). Causes and effects. In L. Shepard & K. Ryan (Eds.), The future of test-based educational accountability. New York, NY: Routledge.
Burch, P., Steinberg, M., & Donovan, J. (2007). Supplemental educational services and NCLB: Policy assumptions, market practices, emerging issues. Educational Evaluation and Policy Analysis, 29(2), 115–133.
Cronbach, L. (1971). Test validation. In R. L. Thorndike (Ed.), Educational measurement (2nd ed., pp. 443–507). Washington, DC: American Council on Education.
Cronbach, L., & Meehl, P. (1955). Construct validity in psychological tests. Psychological Bulletin, 52, 281–302.
Ferrara, S. (2006). Standardized assessment of individual achievement in K-12. In R. L. Brennan (Ed.), Educational measurement (4th ed., pp. 579–621). Westport, CT: American Council on Education/Praeger.
Holland, P. W. (1986). Statistics and causal inference (with discussion and rejoinder). Journal of the American Statistical Association, 81, 945–970.
Holland, P. W., & Thayer, D. (1988). Differential item performance and the Mantel-Haenszel procedure. In H. Wainer & H. I. Braun (Eds.), Test validity (pp. 129–145). Hillsdale, NJ: Lawrence Erlbaum Associates.
Holland, P. (2004). Evidence for causal inference in education research. Invited session on inference, Evidence and Scientific Research at the annual conference of the American Educational Research Association, San Diego, CA.
Kane, M. (1992). An argument-based approach to validity. Psychological Bulletin, 112(3), 527–535.
Kane, M. (2006). Validation. In R. L. Brennan (Ed.), Educational measurement (4th ed., pp. 17–64). Westport, CT: American Council on Education/Praeger.
Koretz, D., & Hamilton, L. (2006). Testing for accountability in K–12. In R. L. Brennan (Ed.), Educational measurement (4th ed., pp. 531–578). Westport, CT: American Council on Education/Praeger.
Linn, R. (2006). Validity and reliability of student assessment results. Unpublished manuscript.
Lockwood, J. R., McCaffrey, D. F., Hamilton, L. S., Stecher, B., Le, V., & Martinez, J. F. (2007). The sensitivity of value-added teacher effect estimates to different mathematics achievement measures. Journal of Educational Measurement, 44(1), 47–68.
McCaffrey, D. F., Lockwood, J. R., Koretz, D., Louis, T. A., & Hamilton, L. (2004). Models for value-added modeling of teacher effects. Journal of Educational and Behavioral Statistics, 29(1), 67–101.
Messick, S. (1989). Validity. In R. Linn (Ed.), Educational measurement (3rd ed., pp. 13–103). New York, NY: American Council on Education/MacMillan.
No Child Left Behind Act of 2001, Pub. L. No. 107–110 § 115 Stat. 1425.
OMNI Institute. (2008). Evaluation of supplemental educational services: 2006–07 academic year data. Unpublished manuscript.
Ridgway, J., & Schoenfeld, A. H. (1994). Balanced assessment: Designing assessment schemes to promote desirable change in mathematics education. Keynote paper for the EARLI Email Conference on Assessment.
Ridgway, J., Zawojewski, J., & Hoover, M. (2000). Problematising evidence-based policy and practice. Evaluation and Research in Education, 14(3, 4), 181–192.
Rubin, D., Stuart, A., & Zannato, E. (2004). A potential outcome view of value-added assessment in education. Journal of Educational and Behavioral Statistics, 29(1), 103–116.
Sanders, W. L., Saxton, A. M., & Horn, S. P. (1997). The Tennessee value-added assessment system, a quantitative, outcomes-based approach to educational measurement. In J. Millman (Ed.), Grading teachers, grading schools. Is student achievement a valid evaluation measure? (pp. 137–162). Thousand Oaks, CA: Corwin Press.
Shepard, L. (1993). Evaluating test validity. Review of Educational Research, 19, 405–450.
Swaminathan, H., & Rogers, H. J. (1990). Detecting differential item functioning using logistic regression procedures. Journal of Educational Measurement, 27(4), 361–370.
The SAS Corporation. (n.d.). SAS ® EVAAS ® for K–12. Retrieved from http://www.sas.com/govedu/edu/k12/evaas/index.html.
U.S. Department of Education. (2004). Standards and assessments peer review guidance: Information and examples for meeting requirements of the No Child Left Behind Act of 2001. Washington, DC: Author.
Vergari, S. (2007). Federalism and market-based education policy: The supplemental educational services mandate. American Journal of Education, 113, 311–339.
Wilson, M. (2005). Constructing measures: An item response modeling approach. Mahwah, NJ: Lawrence Erlbaum Associates.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer Science+Business Media, LLC
About this paper
Cite this paper
Briggs, D.C. (2011). Cause or Effect? Validating the Use of Tests for High-Stakes Inferences in Education. In: Dorans, N., Sinharay, S. (eds) Looking Back. Lecture Notes in Statistics(), vol 202. Springer, New York, NY. https://doi.org/10.1007/978-1-4419-9389-2_8
Download citation
DOI: https://doi.org/10.1007/978-1-4419-9389-2_8
Published:
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4419-9388-5
Online ISBN: 978-1-4419-9389-2
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)