Skip to main content

A Critical Review of Washback Studies: Hypothesis and Evidence

  • Chapter
  • First Online:
Revisiting EFL Assessment

Part of the book series: Second Language Learning and Teaching ((SLLT))

Abstract

This chapter aims to demonstrate the current understanding of washback effects in the context of language testing and language education from the following three perspectives: (1) demonstrating the historical development of the concept of washback in both general education and language education, (2) elaborating the results of the empirical investigations and highlighting the gaps between hypotheses and evidence and (3) identifying the directions and areas for washback studies in future.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 99.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 129.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 129.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  • Alderson, J., & Wall, D. (1993). Does washback exist? Applied Linguistics, 14, 115–129.

    Article  Google Scholar 

  • Alderson, J., & Hamp-Lyons, (1996). TOEFL preparation courses: A study of washback. Language Testing, 13(3), 280–297.

    Article  Google Scholar 

  • Alderson, J. (2004). Foreword. In L. Cheng, Y. Watanabe, & A. Curtis, (Eds.), Washback in language testing: Research contexts and methods (pp. ix–xiii). London: Lawrence Erlbaum.

    Google Scholar 

  • Ames, C., & Archer, J. (1988). Achievement goals in the classroom: Students’ learning strategies and motivation processes. Journal of Educational Psychology, 80, 260–267.

    Article  Google Scholar 

  • Andrews, S., Fullilove, J., & Wong, Y. (2002). Targeting washback—a case-study. System, 30, 207–223.

    Article  Google Scholar 

  • Bailey, K. (1996). Working for washback: A review of the washback concept in language testing. Language Testing, 13(3), 257–279.

    Article  Google Scholar 

  • Bachman, L., & Palmer, A. (1996). Language testing in practice. Oxford: Oxford University Press.

    Google Scholar 

  • Borg, S. (2006). Teacher cognition and language education: Research and practice. London: Continuum.

    Google Scholar 

  • Brookhart, S. (2001). Successful students formative and summative uses of assessment information. Assessment in Education, 8(2), 153–169.

    Article  Google Scholar 

  • Burrows, S. (2004). Washback in classroom-based assessment: A study of the washback effect in the Australian Adult Migrant English Program. In L. Cheng, Y. Watanabe, & A. Curtis (Eds.), Washback in language testing: Research contexts and methods (pp. 113–128). London: Lawrence Erlbaum.

    Google Scholar 

  • Cheng, L. Y. (1997). How does washback influence teaching implications for Hong Kong. Language and Education, 11(1), 38–54.

    Article  Google Scholar 

  • Cheng, L. Y. (1998). Impact of public English examination change on students’ perceptions and attitudes towards their English learning. Studies in Educational Evaluation, 24, 279–301.

    Article  Google Scholar 

  • Cheng, L. Y. (1999). Changing assessment: Washback on teacher perceptions and actions. Teaching and Teacher Education, 15, 253–271.

    Article  Google Scholar 

  • Cheng, L., & Curtis. (2004). Washback or backwash: A review of the impact of testing on teaching and learning. In L. Cheng, Y. Watanabe, & A. Curtis (Eds.), Washback in language testing: Research contexts and methods (pp. 3–18). London: Lawrence Erlbaum.

    Google Scholar 

  • Chudowsky, N., & Behuniak, P. (1998). Using focus groups to examine the consequential aspect of validity. Educational Measurement: Issues and Practice, 17(4), 28–38.

    Article  Google Scholar 

  • Dornyei, Z. (2001). Teaching and researching motivation. Harlow, England: Pearson Education.

    Google Scholar 

  • Fulcher, G. (2010). Practical language testing. London: Hodder Education.

    Google Scholar 

  • Gronlund, N., & Waugh, C. (2009). Assessment of student achievement (9th ed.). New Jersey: Person Education Ltd.

    Google Scholar 

  • Gorsuch, G. (2000). EFL educational policies and educational cultures: Influences on teachers’ approval of communicative activities. TESOL Quarterly, 34(4), 675–710.

    Article  Google Scholar 

  • Hall, L., Kleine, & Paul, F. (1992). Educators’ perceptions of NRT Misuse. Educational Measurement: Issues and Practice, 11, 18–22.

    Google Scholar 

  • Halpin, G., & Halpin, G. (1982). Experimental investigation of the effects of study and testing on student learning, retention, and ratings of instruction. Journal of Educational Psychology, 74(1), 32–38.

    Article  Google Scholar 

  • Harlen, W., & James, M. (1997). Assessment and learning: Differences and relationships between formative and summative assessment. Assessment in Education, 4(3), 365–379.

    Article  Google Scholar 

  • Harlen, W. (2005). Teachers’ summative practices and assessment for learning—tensions and synergies. The Curriculum Journal, 16(2), 203–207.

    Article  Google Scholar 

  • Hayes, B., & Read, J. (2004). IELTS testing preparation in New Zealand: Preparing students for the IELTS academic module. In L. Cheng, Y. Watanabe, & A. Curtis (Eds.), Washback in language testing: Research contexts and methods (pp. 97–112). London: Lawrence Erlbaum.

    Google Scholar 

  • Henrichsen, L. E. (1989). Diffusion of innovations in English language teaching: The ELEC effort in Japan, 1956–1968. New York: Greenwood Press.

    Google Scholar 

  • Herman, J., & Golan, S. (1993). The effects of standardized testing on teaching and schools. Educational Measurement: Issues and Practice, 12(4), 20–25.

    Article  Google Scholar 

  • Herman, J., Webb, N., & Zuniga, S. (2007). Measurement issues in the alignment of standards and assessments. Applied Measurement in Education, 20, 101–126.

    Google Scholar 

  • Huhta, A., Paula, K., & Pitkanen-Huhta, A. (2006). Discursive construction of a high-stakes test: The many faces of a test-taker. Language Testing, 23(3), 326–350.

    Article  Google Scholar 

  • Kellaghan, T., Madaus, G., & Airasian, P. (1982). The effects of standardized testing. Boston and London: Nijhoff Publishing.

    Book  Google Scholar 

  • Kennedy, K., Chan, J., Fok, P., & Yu, W. (2008). Forms of assessment and their potential for enhancing learning: Conceptual and cultural issues. Educational Research for Policy and Practice, 7(3), 197–207.

    Article  Google Scholar 

  • Li, X. (1990). How powerful can a language test be? The MET in China. Journal of Multilingual and Multicultural Development, 1(15), 393–404.

    Google Scholar 

  • Madaus, G. (1988). The influence of testing on the curriculum. Critical Issues in Curriculum: Eighty-seven yearbook of the national society for the study of education. Chicago: The University of Chicago Press.

    Google Scholar 

  • Messick, S. (1989) Validity. In LINN, R. L. (Ed.), Educational measurement. New York, Macmillan.

    Google Scholar 

  • Messick, S. (1996). Validity and washback in language testing. Language Testing, 13(3), 241–256.

    Article  Google Scholar 

  • Mehrens, W., & Kaminski, J. (1989). Methods for improving standardized test scores: Fruitful, fruitless, or fraudulent? Educational Measurement: Issues and Practice, 45(1), 14–22.

    Article  Google Scholar 

  • Power, D., & Alderman, D. (1983). Effects of test familiarization on SAT performance. Journal of Educational Measurement, 20(1), 71–79.

    Article  Google Scholar 

  • Prodromou, L. (1995). The backwash effect: From testing to teaching. ELT Journal, 49(1), 13–25.

    Article  Google Scholar 

  • Qi, L. X. (2004). Has a high-stakes test produced the intended changes? In L. Cheng, Y. Watanabe, & A. Curtis (Eds.), Washback in language testing: Research contexts and methods (pp. 171–190). London: Lawrence Erlbaum.

    Google Scholar 

  • Qi, L. X. (2005). Stakeholders’ conflicting aims undermine the washback function of a high-stakes test. Language Testing, 22(2), 142–173.

    Article  Google Scholar 

  • Ramaprasad, A. (1983). On the definition of feedback. Behavioral Science, 28, 4–13.

    Article  Google Scholar 

  • Ross, J. A., & Bruce, C. D. (2007). Teacher self-assessment: A mechanism for facilitating professional growth. Teaching and Teacher Education, 23, 146–159.

    Google Scholar 

  • Sadler, D. (1989). Formative assessment and the design of instructional systems. Instructional Science, 18, 145–165.

    Article  Google Scholar 

  • Saville, N., & Hawkey, R. (2004). The IELTS impact study: Investigating Washback on teaching materials. In L. Cheng, Y. Watanabe, & A. Curtis (Eds.), Washback in language testing: Research contexts and methods (pp. 73–96). London: Lawrence Erlbaum.

    Google Scholar 

  • Shepard, L., & Bliem, C. (1995). Parents’ thinking about standardized tests and performance assessments. Educational Researcher, 24(8), 25–32.

    Google Scholar 

  • Shohamy, E. (2001). The power of test—a critical perspective on the users of language tests. Essex England: Pearson Education Limited.

    Google Scholar 

  • Shohamy, E., Donitsa-Schmidt, A., & Ferman, I. (1996). Test impact revisited: Washback effect over time. Language Testing, 13(3), 298–317.

    Article  Google Scholar 

  • Smith, M. L. (1991a). Put to the test: The effects of external testing on teachers. Educational Researcher, 20(5), 8–11.

    Article  Google Scholar 

  • Smith, M. L. (1991b). Meanings of test preparation. American Educational Research Journal, 28(3), 521–542.

    Article  Google Scholar 

  • Smith, M., & Rottenberg, C. (1991). Unintended consequences of external testing in elementary schools. Educational Measurement: Issues and Practice, Winter, 10(4), 7–11.

    Article  Google Scholar 

  • Sternberg, R. (2007). Culture, instruction, and assessment. Comparative Education, 43(1), 5–22.

    Article  Google Scholar 

  • Taras, M. (2008). Summative and formative assessment: Perceptions and realities. Active Learning in Higher Education, 9(2), 172–192.

    Article  Google Scholar 

  • Tsagari, D. (2009). Revisiting the concept of test washback: Investigating FCE in Greek language schools. University of Cambridge ESOL Examinations Research Notes, March 35.

    Google Scholar 

  • Wall, D. (2005). The impact of high-stakes examinations on classroom teaching: A case study using insights from testing and innovation theory. Cambridge: Cambridge University Press.

    Google Scholar 

  • Wall, D., & Horak, T. (2006). The TOEFL impact study: Phase 1. The baseline study. TOEFL Monograph 34. Princeton, NJ: Educational Testing Service.

    Google Scholar 

  • Wall, D., & Horak, T. (2008). The TOEFL impact study: Phase 2. Coping with change. TOEFL iBT Research Series No.05. Princeton, NJ: Educational Testing Service.

    Google Scholar 

  • Wall, D., & Horak, T. (2000). Using baseline studies in the investigation of test impact. Assessment in Education, 14(1), 99–116.

    Article  Google Scholar 

  • Watanabe, Y. (1996). Does grammar translation come from the entrance examination? Preliminary findings from classroom-based research. Language Testing, 13(3), 318–333.

    Article  Google Scholar 

  • Watanabe, Y. (2004). Methodology in washback studies. In L. Cheng, Y. Watanabe, & A. Curtis (Eds.), Washback in language testing: Research contexts and methods (pp. 19–36). London: Lawrence Erlbaum.

    Google Scholar 

  • Wei, W. (2014). Can integrated skills tasks change students’ learning strategies and materials? Language Learning Journal,. doi:10.1080/09571736.2014.905970.

    Google Scholar 

  • Wideen, M., O’Shea, T., & Ivory, G. (1997). High-stakes testing and the teaching of science. Canadian Journal of Education, 22(4), 428–444.

    Article  Google Scholar 

  • William, D., & Black, P. (1996). Meanings and consequences: A basis for distinguishing formative from summative functions of assessment. British Educational Research Journal, 22(5), 34–37.

    Google Scholar 

  • William, D. (2000). Integrating summative and formative functions of assessment, Keynote address to the European Association for Educational Assessment, Prague: Czech Republic, November 2000. Retrieved November 11, 2013 from http://eprints.ioe.ac.uk/1151/1/Wiliam2000IntergratingAEA-E_2000_keynoteaddress.pdf.

  • Zeng, K. (1999). Dragon Gate: Competitive examinations and their consequences. London: Cassell.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Wei Wei .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing Switzerland

About this chapter

Cite this chapter

Wei, W. (2017). A Critical Review of Washback Studies: Hypothesis and Evidence. In: Al-Mahrooqi, R., Coombe, C., Al-Maamari, F., Thakur, V. (eds) Revisiting EFL Assessment. Second Language Learning and Teaching. Springer, Cham. https://doi.org/10.1007/978-3-319-32601-6_4

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-32601-6_4

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-32599-6

  • Online ISBN: 978-3-319-32601-6

  • eBook Packages: EducationEducation (R0)

Publish with us

Policies and ethics