Literature Review

Pan, Mingwei

doi:10.1007/978-981-10-0170-3_2

Mingwei Pan²

531 Accesses

Abstract

This chapter reviews the literature pertaining to the present study. As the whole research can be chronologically broken down into three main phases, covering (1) building an argument for embedding nonverbal delivery into speaking assessment, (2) the formulation and (3) the validation of the rating scale for group discussion in formative assessment, this chapter is accordingly organised into five sections, with the first section reviewing nonverbal delivery relating to the first phase, and the other four sections consecutively addressing the related literature concerning rating scale development and validation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Hardcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
For detailed descriptions of turn, refer to Sacks (1992), Sacks et al. (1974), Oreström (1983).
2.
Gu (2006b) uses the term agent-oriented modelling language (AML), yet he later changes the term to agent-oriented modelling (AOM) because AOM perceives the modelling as a methodology, while AOML emphasises its relation with UML as the modelling metalanguage (Gu 2009).

References

ACTFL. 1986. ACTFL proficiency guidelines. Hasting-on-Hudson: American Council on the Teaching of Foreign Languages.
Google Scholar
ACTFL. 1999. Revised ACTFL proficiency guidelines—Speaking. Yonkers: American Council on the Teaching of Foreign Languages.
Google Scholar
AERA, APA, and NCME. 1985. Standards for educational and psychological tests and manuals. Washington, DC: American Psychological Association.
Google Scholar
AERA, APA, and NCME. 1999. Standards for educational and psychological tests and manuals. Washington, DC: American Psychological Association.
Google Scholar
Alderson, J.C. 1981. Report of the discussion on general language proficiency. In Issues in language testing, ed. J.C. Alderson, and A. Hughes, 87–92. London: The British Council.
Google Scholar
Alderson, J.C. 1991. Bands and scores. In Language testing in the 1990s, ed. J.C. Alderson, and B. North, 71–86. London: Modern English Publications and the British Council.
Google Scholar
Alderson, J.C. (ed.). 2002. Common European Framework of Reference for Languages: learning, teaching, assessment: case studies. Strasbourg: Council of Europe.
Google Scholar
Alderson, J.C. 2010. The Common European Framework of Reference for Language. Invited seminar at Shanghai Jiao Tong University, Shanghai, China, Oct 2010.
Google Scholar
Alderson, J.C., and J. Banerjee. 2002. Language testing and assessment (Part 2). Language Teaching 35(2): 79–113.
Article Google Scholar
Alderson, J.C., N. Figueras, H. Kuiper, and G. Nold. 2006. Analyzing tests of reading and listening in relation to the Common European Framework of Reference: the experience of the Dutch CEFR Construct Project. Language Assessment Quarterly 3(1): 3–30.
Article Google Scholar
Alibali, M.W., L. Flevares, and S. Goldin-Meadow. 1997. Assessing knowledge conveyed in gesture: do teachers have the upper hand? Journal of Educational Psychology 89: 183–193.
Article Google Scholar
Allal, L., and L.M. Lopez. 2005. Formative assessment of learning: a review of publication in French. In Formative assessment: improving learning in secondary classrooms, ed. J. Looney, 241–264. Paris: Organisation for Economic Cooperation and Development.
Google Scholar
Anastasi, A. 1950. Some implications of cultural factors for test construction. New York: Educational Testing Service.
Google Scholar
Anastasi, A. 1954. Psychological testing. New York: Macmillan.
Google Scholar
Anastasi, A. 1961. Psychological testing, 2nd ed. New York: Macmillan.
Google Scholar
Anastasi, A. 1976. Psychological testing, 4th ed. New York: Macmillan.
Google Scholar
Anastasi, A. 1982. “What do intelligence tests measure?” In On educational testing: Intelligence, performance standards, test anxiety, and latent traits, eds. S.B. Anderson, and J.S. Hemlick, 5–28. San Francisco, CA: Jossey-Bass, Inc.
Google Scholar
Angoff, W. 1988. Validity: an evolving concept. In Test validity, ed. H. Wainer, and H.I. Braun, 19–32. Hillsdale: Lawrence Erlbaum Associates.
Google Scholar
APA. 1954. Technical recommendations for psychological tests and diagnostic techniques. Psychological Bulletin Supplement 51(2): 1–38.
Article Google Scholar
APA, AERA, and NCME. 1966. Standards for educational and psychological tests and manuals. Washington, DC: American Psychological Association.
Google Scholar
APA, AERA, and NCME. 1974. Standards for educational and psychological tests and manuals. Washington, DC: American Psychological Association.
Google Scholar
Applebee, A.N. 2000. Alternative models of writing development. In Perspectives on writing: research, theory, practice, ed. R. Indrisano, and J.R. Squire, 90–111. Newark: International Reading Association.
Google Scholar
Argyle, M., and M. Cook. 1976. Gaze and mutual gaze. Cambridge: Cambridge University Press.
Google Scholar
Bacha, N. 2001. Writing evaluation: what can analytic versus holistic essay scoring tell us? System 29: 371–383.
Article Google Scholar
Bachman, L.F. 1988. Problems in examining the validity of the ACTFL oral proficiency interview. Studies in Second Language Acquisition 10(2): 149–164.
Article Google Scholar
Bachman, L.F. 1990. Fundamental considerations in language testing. Oxford: Oxford University Press.
Google Scholar
Bachman, L.F. 1991. What does language testing have to offer? TESOL Quarterly 25(4): 671–704.
Article Google Scholar
Bachman, L.F. 2005. Building and supporting a case for test use. Language Assessment Quarterly 2(1): 1–34.
Article Google Scholar
Bachman, L.F., and A.S. Palmer. 1981. The construct validation of the FSI oral interview. Language Learning 31: 67–86.
Article Google Scholar
Bachman, L.F., and A.S. Palmer. 1982. The construct validation of some components of communicative proficiency. TESOL Quarterly 16(4): 449–465.
Article Google Scholar
Bachman, L.F., and A.S. Palmer. 1989. The construct validation of self-ratings of communicative language ability. Language Testing 6(4): 449–465.
Google Scholar
Bachman, L.F., and A.S. Palmer. 1996. Language testing in practice: designing and developing useful language tests. Oxford: Oxford University Press.
Google Scholar
Bachman, L.F., and A.S. Palmer. 2010. Language assessment in practice: developing language tests and justifying their use the real world. Oxford: Oxford University Press.
Google Scholar
Bachman, L.F., and S.J. Savignon. 1986. The evaluation of communicative language proficiency: a critique of the ACTFL oral interview. Modern Language Journal 70(3): 380–390.
Article Google Scholar
Bachman, L.F., B.M. Lynch, and M. Mason. 1995. Investigating variability in tasks and rater judgments in a performance test of foreign language speaking. Language Testing 12(2): 238–257.
Article Google Scholar
Bae, J., and L.F. Bachman. 1998. A latent variable approach to listening and reading: testing factorial invariance across two groups of children in the Korean/English two-way immersion program. Language Testing 15(3): 380–414.
Google Scholar
Baird, L.L. 1983. The search for communication skills. Educational Testing Service Research Report, No. 83-14. Princeton: Educational Testing Service.
Google Scholar
Baldry, A., and P. Thibault. 2006. Multimodal transcription and text analysis. London: Equinox.
Google Scholar
Barakat, R.A. 1973. Arabic gestures. Journal of Popular Culture 6(4): 749–787.
Article Google Scholar
Barkaoui, K. 2007. Rating scale impact on EFL essay marking: a mixed-method study. Assessing Writing 12(2): 86–107.
Article Google Scholar
Barkaoui, K. 2011. Think-aloud protocols in research on essay rating: an empirical study of their veridicality and reactivity. Language Testing 28(1): 51–75.
Article Google Scholar
Bateman, J.A. 2008. Multimodality and genre: a foundation for the systematic analysis of multimodal documents. London: Palgrave Macmillan.
Book Google Scholar
Bateman, J., J. Delin, and R. Henschel. 2004. Multimodality and empiricism: preparing for a corpus-based approach to the study of multimodal meaning-making. In Perspectives on multimodality, ed. E. Ventola, C. Cassily, and M. Kaltenbacher, 65–88. Philadelphia: John Benjamins.
Chapter Google Scholar
Bateman, J.A., J. Delin, and R. Henschel. 2006. Mapping the multimodal genres of traditional and electronic newspapers. In New directions in the analysis of multimodal discourse, ed. T.D. Royce, and W.L. Bowcher, 147–172. Mahwah: Lawrence Erlbaum Associates.
Google Scholar
Black, P., and D. Wiliam. 1998. Assessment and classroom learning. Assessment in Education 5(1): 7–74.
Article Google Scholar
Black, P., and D. Wiliam. 2009. Developing the theory of formative assessment. Educational Measurement, Evaluation and Accountability 21(1): 5–31.
Article Google Scholar
Bloom, B.S., J.T. Hasting, and G.F. Madaus (eds.). 1971. Handbook of formative and summative evaluation of student learning. New York: McGraw-Hill.
Google Scholar
Bonk, W.J., and G.J. Ockey. 2003. A many-facet Rasch analysis of the second language group oral discussion task. Language Testing 20(1): 89–110.
Article Google Scholar
Bourne, J., and C. Jewitt. 2003. Orchestrating debate: a multimodal approach to the study of the teaching of higher order literacy skills. Reading: Literacy and Language, UKRA, July, 64–72.
Google Scholar
Brindley, G. 1986. The assessment of second language proficiency: issues and approaches. Adelaide: National Curriculum Resource Centre.
Google Scholar
Brindley, G. 1991. Defining language ability: the criteria for criteria. In Current developments in language testing, ed. S. Anivan, 139–164. Singapore: Regional Language Centre.
Google Scholar
Brindley, G. 2002. Issues in language assessment. In The Oxford handbook of applied linguistics, ed. R.B. Kaplan, 459–470. Oxford: Oxford University Press.
Google Scholar
Brookhart, S.M. 2004. Classroom assessment: tensions and intersection in theory and practice. Teachers College Record 106(3): 429–458.
Article Google Scholar
Brookhart, S.M. 2007. Expanding views about formative classroom assessment: a review of the literature. In Formative classroom assessment: theory into practice, ed. J.H. McMillan, 43–62. New York: Teachers College Press.
Google Scholar
Brooks, L. 2009. Interacting in pairs in a test of oral proficiency: co-constructing a better performance. Language Testing 26(3): 341–366.
Article Google Scholar
Brown, A. 2003. Interviewer variation and the co-construction of speaking proficiency. Language Testing 20(1): 1–25.
Article Google Scholar
Brown, A., N. Iwashita, and T. McNamara. 2005. An examination of rater orientations and test taker performance on English for academic purposes speaking tasks. TOEFL Monograph Series, No. TOEFL-MS-29. Princeton: Educational Testing Service.
Google Scholar
Brown, J.D., and K.M. Bailey. 1984. A categorical instrument for scoring second writing skills. Language Learning 34(1): 21–42.
Article Google Scholar
Brown, J.D., and T. Hudson. 1998. The alternatives in language assessment. TESOL Quarterly 32(4): 653–675.
Article Google Scholar
Brown, G., and G. Yule. 1983. Discourse analysis. Cambridge: Cambridge University Press.
Book Google Scholar
Brumfit, C.J. 1984. Communicative methodology in language teaching: the roles of fluency and accuracy. Cambridge: Cambridge University Press.
Google Scholar
Brumfit, C.J., and K. Johnson. 1979. The communicative approach to language teaching. Oxford: Oxford University Press.
Google Scholar
Burgoon, J.K., and T. Saine. 1978. The unspoken dialogue: an introduction to nonverbal communication. Boston: Hughton Mifflin Company.
Google Scholar
Burgoon, J.K., D.A. Coker, and R.A. Coker. 1986. Communicative effects of gaze behavior: a test of two contrasting explanations. Human Communication Research 12: 495–524.
Article Google Scholar
Campbell, D.T., and D.W. Fiske. 1959. Convergent and discriminant validation by the multi-trait multi-method matrix. Psychological Bulletin 56: 81–105.
Article Google Scholar
Canale, M. 1983. From communicative competence to communicative language pedagogy. In Language and communication, ed. J.C. Richards, and R.W. Schmidt, 2–27. London: Longman.
Google Scholar
Canale, M., and M. Swain. 1980. Theoretical bases of communicative approaches to second language teaching and testing. Applied Linguistics 1(1): 1–47.
Article Google Scholar
Candlin, C.N. 1986. Explaining communicative competence limits of testability? In Toward communicative competence testing: proceedings of the second TOEFL invitational conference, ed. C.W. Stansfield, 38–57. Princeton: Educational Testing Service.
Google Scholar
Caple, H. 2008. Intermodal relations in image nuclear news stories. In Multimodal semiotics: functional analysis in contexts of education, ed. L. Unsworth, 125–138. London: Continuum.
Google Scholar
Carroll, J.B. 1961. The nature of data, or how to choose a correlation coefficient. Psychometrika 35(4): 347–372.
Article Google Scholar
Carroll, J.B. 1968. The psychology of language testing. In Language testing symposium: a psycholinguistic perspective, ed. A. Davies, 46–69. London: Oxford University Press.
Google Scholar
Celce-Murcia, M., Z. Dörneyei, and S. Thurrell. 1997. Direct approaches in L2 instruction: a turning point in communicative language teaching? TESOL Quarterly 31(1): 141–152.
Article Google Scholar
Cerrato, L. 2005. Linguistic functions of head nods. In Gothenburg papers in theoretical linguistics 92: proceedings from 2nd Nordic conference on multi-modal communication, ed. J. Allwood, and B. Dorriots, 137–152. Sweden: Gothenburg University.
Google Scholar
Chafe, W. 1994. Discourse, consciousness, and time: The flow and displacement of conscious experience in speaking and writing. Chicago: University of Chicago Press.
Google Scholar
Chalhoub-Deville, M. 1995. Deriving oral assessment scales across different tests and rater groups. Language Testing 12(1): 16–33.
Article Google Scholar
Chapelle, C.A. 1998. Field independence: a source of language test variance? Language Testing 15(1): 62–82.
Google Scholar
Chapelle, C.A. 1999. Validity in language assessment. Annual Review of Applied Linguistics 19: 254–272.
Article Google Scholar
Chapelle, C.A., M.K. Enright, and J. Jamieson (eds.). 2008. Building a validity argument for the Test of English as a Foreign Language. New York: Routledge.
Google Scholar
Chapelle, C.A., M.K. Enright, and J. Jamieson. 2010. Does an argument-based approach to validity make a difference? Educational Measurements: Issues and Practice 29(1): 3–13.
Google Scholar
Charney, D. 1984. The validity of using holistic scoring to evaluate writing: a critical overview. Research in the Teaching of English 18(1): 65–81.
Google Scholar
Chen, R. 2008. Some words on writing a multimodal lesson ware for English teaching. Journal of Fujian Education Institute 1: 75–77.
Google Scholar
Chen, Y., and G. Huang. 2009. Multimodal construal of heteroglossia: evidence from language textbooks. Computer Assisted Foreign Language Education 6: 35–41.
Google Scholar
Chen, Y., and H. Wang. 2008. Ideational meaning of image and text-image relations. Journal of Ningbo University (Education Edition) 1: 124–129.
Google Scholar
Cheng, L. 2005. Changing language teaching through language testing: a washback study. Cambridge: Cambridge University Press.
Google Scholar
Chomsky, N. 1965. Aspects of the theory of syntax. Cambridge: MIT Press.
Google Scholar
Cienki, A. 2008. Why study metaphor and gesture? In Metaphor and Gesture, eds. A. Cienki and C. Müller, 5–26. Amsterdam/Philadelphia: John Benjamins Publishing Company.
Google Scholar
Cizek, G.J. 2010. An introduction to formative assessment: history, characteristics and challenges. In Handbook of formative assessment, ed. H.L. Andrade, and G.J. Cizek, 3–17. New York: Routledge.
Google Scholar
Clark, J.L. 1985. Curriculum renewal in second language learning: an overview. Canadian Modern Language Review 42(3): 342–360.
Google Scholar
Clarkson, R., & M.T. Jensen. 1995. Assessing achievement in English for professional employment programmes. In Language assessment in action, ed. G. Brindley, pp. 165–194. Sydney, Macquarie University: National Centre for English Language Teaching and Research.
Google Scholar
Cohen, A. 1994. Assessing language ability in the classroom, 2nd ed. Boston: Heinle and Heinle Publishers.
Google Scholar
Connor, U., and P.L. Carrel. 1993. The interpretation of the tasks by writers and readers in holistically rated directed assessment of writing. In Reading in the composition classroom: second language perspectives, ed. J.G. Carson, and I. Leki, 141–160. Boston: Heine & Heine.
Google Scholar
Connor, U., and A. Mbaye. 2002. Discourse approaches to writing assessment. Annual Review of Applied Linguistics 22: 263–278.
Article Google Scholar
Cooper, C.R. 1977. Holistic evaluation of writing. In Evaluating writing: describing, measuring, judging, ed. C.R. Cooper, and L. Odell, 3–31. Urbana: NCTE.
Google Scholar
Corder, S.P. 1983. Strategies of communication. In Strategies in interlanguage communication, ed. C. Færch, and G. Kasper, 15–19. London: Longman.
Google Scholar
Cortazzi, M. 1993. Narrative analysis. London: Falmer Press.
Google Scholar
Council of Europe. 2001. Common European framework of reference for languages: learning, teaching, assessment. Cambridge: Cambridge University Press.
Google Scholar
Cowie, B., and B. Bell. 1999. A model of formative assessment in science education. Assessment in Education 6(1): 102–116.
Google Scholar
Creider, C. 1977. Towards a description of East African gestures. Sign Language Studies 14: 1–20.
Article Google Scholar
Cronbach, L.J. 1949. Essentials of psychological testing. New York: Harper & Row.
Google Scholar
Cronbach, L.J. 1971. Test validation. In Educational measurement, 2nd ed, ed. R.L. Thorndike, 443–507. Washington, DC: American Council on Education.
Google Scholar
Cronbach, L.J. 1980. Validity on parole: how can we go straight? New directions for testing and assessment: Measuring achievement over a decade. Proceedings of the 1979 ETS invitational conference, pp. 99–108. San Francisco: Jossey-Bass.
Google Scholar
Cronbach, L.J. 1988. Five perspectives on validity argument. In Test validity, ed. H. Wainer, and H.I. Braun, 3–17. Hillsdale: Lawrence Erlbaum Associates.
Google Scholar
Cronbach, L.J. 1989. Construct validation after thirty years. In Intelligence: measurement, theory, and public policy, ed. R. Linn, 147–167. Urbana: University of Chicago.
Google Scholar
Cronbach, L.J., and P.C. Meehl. 1955. Construct validity in psychological tests. Psychological Bulletin 52(4): 281–302.
Article Google Scholar
Cumming, A. 1990. Expertise in evaluating second language composition. Language Testing 7(1): 31–51.
Article Google Scholar
Cumming, A., R. Kantor, and D.E. Powers. 2001. Scoring TOEFL essays and TOEFL 2000 prototype writing tasks: an investigation into raters’ decision making and development of a preliminary analytic framework. TOEFL Monograph Series, No. TOEFL-MS-22. Princeton: Educational Testing Service.
Google Scholar
Cumming, A. 2009. Language assessment in education: tests, curricula and teaching. Annual Review of Applied Linguistics 29: 90–100.
Article Google Scholar
Cumming, A., R. Kantor, and D.E. Powers. 2002. Decision making while rating ESL/EFL writing tasks: a descriptive framework. Modern Language Journal 86: 67–96.
Article Google Scholar
Cumming, A., R. Kantor, K. Baba, U. Erdosy, K. Eouanzoui, and M. James. 2006. Analysis of discourse features and verification of scoring levels for independent and integrated tasks for the new TOEFL. Princeton: Educational Testing Service.
Google Scholar
Cureton, E.E. 1950. Validity. In Educational measurement, ed. E.F. Lingquist, 621–694. Washington, DC: American Council on Education.
Google Scholar
Daly, A., and L. Unsworth. 2011. Analysis and comprehension of multimodal texts. Australian Journal of Language and Literacy 34(1): 61–80.
Google Scholar
Daniels, H. 2001. Vygotsky and pedagogy. London: Routledge.
Google Scholar
Davidson, F., and B. Lynch. 2002. Testcraft: a teacher’s guide to writing and using language test specifications. New Haven: Yale.
Google Scholar
Davies, A., and P. LeMahieu. 2003. Assessment for learning: reconsidering portfolio and research evidence. In Optimising new modes of assessment: in search of qualities and standards, ed. M. Sergers, F. Dochy, and E. Cascallar, 141–169. Dordrecht: Kluwer Academic Publishers.
Chapter Google Scholar
Davies, A., A. Brown, C. Elder, K. Hill, T. Lumley, and T. McNamara. 1999. Dictionary of language testing. Cambridge: Cambridge University Press.
Google Scholar
Davison, C. 2004. The contradictory culture of teacher-based assessment: ESL assessment practices in Australian and Hong Kong secondary schools. Language Testing 21(3): 305–334.
Article Google Scholar
de Jong, J.H.A.L. 1992. Assessment of language proficiency in the perspective of the 21st century. AILA Review 9: 39–45.
Google Scholar
Derewianka, B., and C. Coffin. 2008. Visual representations of time in history textbooks. In Multimodal semiotics, ed. L. Unsworth, 187–200. London: Continuum.
Google Scholar
Djonov, E.N. 2006. Analysing the organisation of information in websites: from hypermedia design to systemic functional hypermedia discourse analysis. Unpublished Ph.D. thesis, University of New South Wales, Australia.
Google Scholar
Douglas, D., and J. Smith. 1997. Theoretical underpinnings of the Test of Spoken English revision project. TOEFL Monograph Series, No. TOEFL-MS-9. Princeton: Educational Testing Service.
Google Scholar
Douglas, D. 2000. Assessing languages for specific purposes. Cambridge: Cambridge University Press.
Google Scholar
Ducasse, A.M., and A. Brown. 2009. Assessing paired orals: raters’ orientation to interaction. Language Testing 26(3): 423–443.
Article Google Scholar
Dwyer, C.A. 2000. Excerpt from validity: theory into practice. The Score 22(4): 6–7.
Google Scholar
Ebel, R.L. 1961. Must all tests be valid? American Psychologist 16(10): 640–647.
Article Google Scholar
Ebel, R. L., and D. A. Frisbie. 1991. Essentials of educational measurement, 5th ed. Englewood Cliffs, NJ: Prentice—Hall.
Google Scholar
Efron, D. 1941. Gesture, race and culture. The Hague: Mouton.
Google Scholar
Egbert, M.M. 1998. Miscommunication in language proficiency interviews of first-year German students: a comparison with natural conversation. In Talking and testing: discourse approaches to the assessment of oral proficiency, ed. R. Young, and W. He, 147–172. Philadelphia: John Benjamins.
Chapter Google Scholar
Eggins, S., and D. Slade. 1997. Analysing casual conversation. London: Cassell.
Google Scholar
Ekman, P., and W.V. Friesen. 1969. Nonverbal leakage and clues to deception. Psychiatry 32: 88–106.
Google Scholar
Ekman, P., and W.V. Friesen. 1974. Detecting deception from body or face. Journal of Personality and Social Psychology 29: 288–298.
Article Google Scholar
Ellsworth, P.C., and L.M. Ludwig. 1971. Visual behaviour in social interaction. Journal of Communication 21(4): 375–403.
Google Scholar
Enfield, N.J. 2009. The anatomy of meaning: Speech, gesture, and composite utterances. Cambridge: Cambridge University Press.
Google Scholar
Engestrom, Y. 1987. Learning by expanding: an activity theoretical approach to developmental research. Helsinki: Orienta-Konsultit Oy.
Google Scholar
Erdosy, M.U. 2004. Exploring variability in judging writing ability in a second language: a study of four experienced raters of ESL compositions. TOEFL Research Report, No. RR-03-17. Princeton: Educational Testing Service.
Google Scholar
Ericsson, K.A., and H. Simon. 1993. Protocol analysis. Cambridge: MIT Press.
Google Scholar
Færch, C., and G. Kasper (eds.). 1983. Strategies in interlanguage communication. London: Longman.
Google Scholar
Færch, C., et al. 1984. Learner language and language learning. Philadelphia: Multilingual Matters Ltd.
Google Scholar
Feng, D. 2011. Visual space and ideology: a critical cognitive analysis of spatial orientations in advertising. In Multimodal studies: exploring issues and domains, ed. K.L. O’Halloran, and B.A. Smith, 55–75. London: Routledge.
Google Scholar
Folland, D., and D. Robertson. 1976. Towards objective in group oral testing. ELT Journal 30(2): 156–167.
Article Google Scholar
Fulcher, G. 1987. Tests of oral performance: the need for data-based criteria. ELT Journal 41(4): 287–291.
Article Google Scholar
Fulcher, G. 1993. The construction and validation of rating scales for oral tests in English as a foreign language. Unpublished Ph.D. thesis. University of Lancaster, UK.
Google Scholar
Fulcher, G. 1996a. Does thick description lead to smart tests? A data-based approach to rating scale construction. Language Testing 13(2): 208–238.
Article Google Scholar
Fulcher, G. 1996b. Invalidating validity claims for the ACTFL oral rating scale. System 24(2): 163–172.
Article Google Scholar
Fulcher, G. 1997. The testing of speaking in a second language. In Encyclopaedia of language and education, vol. 7, ed. C. Clapham, and D. Corson, 75–85., Language testing and assessment New York: Springer.
Chapter Google Scholar
Fulcher, G. 2003. Testing second language speaking. London: Longman/Pearson Education.
Google Scholar
Fulcher, G. 2004. Deluded by artifices? The Common European Framework and harmonization. Language Assessment Quarterly 1(4): 253–266.
Article Google Scholar
Fulcher, G. 2010. Practical language testing. London: Hodder Education.
Google Scholar
Fulcher, G., and F. Davidson. 2007. Language testing and assessment: an advanced resource book. London: Routledge.
Book Google Scholar
Fulcher, G., F. Davidson, and J. Kemp. 2011. Effective rating scale development for speaking tests: performance decision trees. Language Testing 27(1): 1–25.
Google Scholar
Galloway, V.B. 1987. From defining to developing proficiency: a look at the decisions. In Defining and developing proficiency: guidelines, implementations, and concepts, ed. H. Byrnes, and M. Canale, 25–73. Lincolnwood: National Textbook Company.
Google Scholar
Garrett, H.E. 1947. Statistics in psychology and education, 3rd ed. New York: Longmans, Green & Company.
Google Scholar
Goldin-Meadow, S., and M.A. Singer. 2003. From children’s hands to adults’ ears: Gesture’s role in teaching and learning. Developmental Psychology 39: 509–520.
Article Google Scholar
Goodwin, L.D. 1997. Changing conceptions of measurement validity. Journal of Nursing Education 36: 102–107.
Google Scholar
Goodwin, L.D. 2002. Changing conceptions of measurement validity: an updated on the new standards. Journal of Nursing Education 41: 100–106.
Google Scholar
Goodwin, C., and J.C. Heritage. 1990. Conversation analysis. Annual Review of Anthropology 19: 283–307.
Article Google Scholar
Goodwin, L.D., and N.L. Leech. 2003. The meaning of validity in the new standards for educational and psychological testing: implications for measurement courses. Measurement and Evaluation in Counseling and Development 36(3): 181–191.
Google Scholar
Goulden, N.R. 1992. Theory and vocabulary for communication assessments. Communication Education 41(3): 258–269.
Article Google Scholar
Goulden, N.R. 1994. Relationship of analytic and holistic methods to rater’s scores for speeches. The Journal of Research and Development in Education 27: 73–82.
Google Scholar
Grant, L., and L. Ginther. 2000. Using computer-tagged linguistic features to describe L2 writing differences. Journal of Second Language Writing 9: 123–145.
Article Google Scholar
Green, J.R. 1968. A gesture inventory for the teaching of Spanish. Philadelphia: Chilton Books.
Google Scholar
Green, A. 1998. Verbal protocol analysis in language testing research: a handbook. Cambridge: Cambridge University Press.
Google Scholar
Green, A. 2007. Washback to learning outcomes: a comparative study of IELTS preparation and university pre-sessional language courses. Assessment in Education 14(1): 75–97.
Article Google Scholar
Grierson, J. 1995. Classroom-based assessment in intensive English centres. In Language assessment in action, ed. G. Brindley, 239–270. Sydney: National Centre for English Language Teaching and Research.
Google Scholar
Grootenboer, H. 2006. Treasuring the gaze: eye miniature portraits and the intimacy of vision. Art Bulletin 88(3): 496–507.
Article Google Scholar
Gu, Y. 2006a. Multimodal text analysis: a corpus linguistic approach to situated discourse. Text & Talk 26(2): 127–167.
Article Google Scholar
Gu, Y. 2006b. Agent-oriented modelling language, Part 1: modelling dynamic behaviour. Proceedings of the 20th international CODATA conference, Beijing, pp. 21–47. Beijing: Information Centre, Chinese Academy of Social Sciences.
Google Scholar
Gu, Y. 2007. Learning by multimedia and multimodality. In E-learning in China: Sino-UK initiatives into policy, pedagogy and culture, ed. H. Spencer-Oatey, 37–56. Hong Kong: The Hong Kong University Press.
Google Scholar
Gu, Y. 2009. From real life situated discourse to video-stream data-mining: an argument for agent-oriented modelling for multimodal corpus compilation. International Journal of Corpus Linguistics 14(4): 433–466.
Article Google Scholar
Guijarro, A.J.M., and M.J.P. Sanz. 2009. On interaction of image and verbal text in a picture book: a multimodal and systemic functional study. In The world told and the world shown: multisemiotic issues, ed. E. Ventola, and A.J.M. Guijarro, 107–123. Hampshire: Palgrave Macmillan.
Google Scholar
Guilford, J.P. 1946. New standards for test evaluation. Educational and Psychological Measurement 6(3): 427–438.
Google Scholar
Guion, R.M. 1977. Content validity: the source of my discontent. Applied Psychological Measurement 1(1): 1–10.
Article Google Scholar
Gulliksen, H. 1950. Theory of mental tests. Hillsdale: Lawrence Erlbaum Associates.
Book Google Scholar
Guo, L. 2004. Multimodality in biology textbooks. In Multimodal discourse analysis: systemic-functional perspectives, ed. K.L. O’Halloran, 196–219. London: Continuum.
Google Scholar
Hale, G.A., D.A. Rock, and T. Jirele. 1989. Confirmatory factor analysis of the TOEFL. TOEFL Research Report, No. RR-32. Princeton NJ: Educational Testing Service.
Google Scholar
Hall, E.T. 1959. The silent language. New York: Doubleday.
Google Scholar
Halliday, M.A.K. 1973. Explorations in the functions of language. London: Edward Arnold.
Google Scholar
Halliday, M.A.K. 1976. The form of a functional grammar. In Halliday: system and function in language, ed. G. Kress, 101–135. Oxford: Oxford University Press.
Google Scholar
Halliday, M.A.K. 1978. Language as social semiotic: the social interpretation of language and meaning. London: Edward Arnold.
Google Scholar
Halliday, M.A.K. 1985. An introduction to functional grammar. London: Arnold.
Google Scholar
Halliday, M.A.K., and R. Hasan. 1976. Cohesion in English. London: Longman.
Google Scholar
Halliday, M.A.K., and C.M.I.M. Matthiessen. 2004. An introduction to functional grammar, 3rd ed. London: Edward Arnold.
Google Scholar
Halliday, M.A.K., A. McIntosh, and P. Strevens. 1964. The linguistic sciences and language teaching. Bloomington: Indiana University Press.
Google Scholar
Hamp-Lyons, L. 1990. Second language writing: assessment issues. In Second language writing: research insights for the classroom, ed. B. Kroll, 69–87. New York: Cambridge University Press.
Chapter Google Scholar
Hamp-Lyons, L. 1991. Scoring procedures for ESL contexts. In Assessing second language writing in academic contexts, ed. L. Hamp-Lyons, 241–276. Norwood: Ablex.
Google Scholar
Hamp-Lyons, L. 1997. Washback, impact and validity: ethical concerns. Language Testing 14(3): 295–303.
Article Google Scholar
Hatch, E. 1978. Discourse analysis and second language acquisition. In Second language acquisition: a book of readings, ed. E. Hatch, 401–435. Rowley: Newbury House.
Google Scholar
Hattie, J., and H. Timperley. 2007. The power of feedback. Review of Educational Research 77(1): 81–112.
Article Google Scholar
Hawkey, R. 2001. Towards a common scale to describe L2 writing performance. Cambridge Research Notes 5: 9–13.
Google Scholar
Hawkey, R., and F. Barker. 2004. Developing a common scale for the assessment of writing. Assessing Writing 9(2): 122–159.
Article Google Scholar
He, W. 1998. Answering questions in LPIs: a case study. In Talking and testing: discourse approaches to the assessment of oral proficiency, ed. R. Young, and W. He, 101–116. Philadelphia: John Benjamins.
Chapter Google Scholar
Heath, C.C., and P. Luff. 2007. Gesture and institutional interaction: figuring bids in auctions of fine art and antiques. Gesture 7(2): 215–240.
Article Google Scholar
Hempel, C.G. 1965. Aspects of scientific explanation and other essays in the philosophy of science. Glencoe: Free Press.
Google Scholar
Henley, N.M. 1977. Body politics: power, sex, and nonverbal communication. Englewood Cliffs: Prentice-Hall.
Google Scholar
Henley, N.M., and S. Harmon. 1985. The nonverbal semantics of power and gender: a perceptual study. In Power, dominance, and nonverbal behavior, ed. S.L. Ellyson, and J.F. Dovidio, 151–164. New York: Springer.
Chapter Google Scholar
Herman, J.L., and K. Choi. 2008. Formative assessment and the improvement of middle school science learning: The role of teacher accuracy. CRESST Report 740. Los Angeles, CA: National Center for Research on Evaluation, Standards, and Student Testing.
Google Scholar
Hess, E.H. 1975. The tell-tale eye: how your eyes reveal hidden thoughts and emotions. New York: van Nostrand Reinhold.
Google Scholar
Hilsdon, J. 1995. The group oral exam: advantages and limitations. In Language testing in the 1990s: the communicative legacy, ed. C. Alderson, and B. North, 189–197. Hertfordshire: Prentice Hall International.
Google Scholar
Hood, S. 2004. Managing attitude in undergraduate academic writing: A focus on the introductions to research reports. In Analysing academic writing: Contextualized frameworks, eds. L.J. Ravelli, and R.A. Ellis, 24–44. London: Continuum.
Google Scholar
Hood, S. 2006. The persuasive power of prosodies: Radiating values in academic writing. Journal of English for Academic Purposes, 5(1):37–49.
Google Scholar
Hood, S.E. 2007. Gesture and meaning making in face-to-face teaching. Paper presented at the Semiotic Margins Conference, University of Sydney.
Google Scholar
Hood, S.E. 2010. Mimicking and mocking identities: the roles of language and body language in Taylor Mali’s “Speak with conviction”. Invited seminar at the Hong Kong Polytechnic University, 4 November 2010.
Google Scholar
Hood, S.E. 2011. Body language in face-to-face teaching: a focus on textual and interpersonal meaning. In Semiotic margins: meanings in multimodalities, ed. S. Dreyfus, S. Hood, and S. Stenglin, 31–52. London: Continuum.
Google Scholar
Hopper, R., S. Koch, and J. Mandelbaum. 1986. Conversation analysis methods. In Contemporary issues in language and discourse processes, ed. D.G. Ellis, and W.A. Donohue, 169–186. Hilldale: Lawrence Erlbaum Associates.
Google Scholar
Hornik, J. 1987. The effect of touch and gaze upon compliance and interest of interviewees. The Journal of Social Psychology 127: 681–683.
Google Scholar
House, E.T. 1980. Evaluating with validity. Beverly Hills: Sage Publications.
Google Scholar
Hu, L.T., and P.M. Bentler. 1999. Cutoff criteria for fit indexes in covariance structure analysis: conventional criteria versus new alternatives. Structural Equation Modelling: A Multidisciplinary Journal 6: 1–55.
Article Google Scholar
Hu, Z., and J. Dong. 2006. How meaning is construed multimodally: a case study of a PowerPoint presentation contest. Computer Assisted Foreign Language Education 3: 3–12.
Google Scholar
Huerta-Macias, A. 1995. Alternative assessment: responses to commonly asked questions. TESOL Journal 5(1): 8–11.
Google Scholar
Hughes, A. 2003. Testing for language teachers, 2nd ed. Cambridge: Cambridge University Press.
Google Scholar
Hulstijn, J.H. 2007. The shaky ground beneath the CEFR: quantitative and qualitative dimensions of language proficiency. The Modern Language Journal 91(4): 663–667.
Article Google Scholar
Hulstijn, J.H. 2011. Language proficiency in native and nonnative speakers: an agenda for research and suggestions for second-language assessment. Language Assessment Quarterly 8(3): 229–249.
Article Google Scholar
Hymes, D.H. 1962. The ethnography of speaking. In Anthropology and human behaviour, ed. T. Gladwin, and W.C. Sturtevant, 13–53. Washington: The Anthropology Society of Washington.
Google Scholar
Hymes, D.H. 1964. Introduction: toward ethnographies of communication. American Anthropologist 6(6): 1–34.
Article Google Scholar
Hymes, D.H. 1972. On communicative competence. In Sociolinguistics, ed. J. Pride, and J. Holmes, 269–293. Harmondsworth: Penguin.
Google Scholar
Hymes, D.H. 1973. Toward linguistic competence. Texas working papers in sociolinguistics (working paper No. 16). Austin, Tx: Centre for Intercultural Studies in Communication, and Department of Anthropology, University of Texas.
Google Scholar
Hymes, D.H. 1974. Foundations in sociolinguistics: an ethnographic approach. Philadelphia: University of Pennsylvania Press.
Google Scholar
Hymes, D.H. 1982. Toward linguistic competence. Philadelphia: Graduate School of Education, University of Pennsylvania.
Google Scholar
Iedema, R. 2001. Analysing film and television: a social semiotic account of hospital: an unhealthy business. In Handbook of visual analysis, ed. T. van Leeuwen, and C. Jewitt, 183–204. London: Sage.
Google Scholar
Iizuka, Y. 1992. Extraversion, introversion and visual interaction. Perceptual and Motor Skills 74: 43–59.
Article Google Scholar
Ingram, D., and E. Wylie. 1993. Assessing speaking proficiency in the international English language testing system. In A new decade of language testing research: selected papers from the 1990s language testing research colloquium, ed. D. Douglas, and C. Chapelle, 220–234. Alexandria: TESOL Inc.
Google Scholar
Jacobs, E. 1988. Clarifying qualitative research: A focus on traditions. Educational Researcher, 17(1):16–24.
Google Scholar
Jackendoff, R. 1983. Semantics and cognition. Cambridge: MIT Press.
Google Scholar
Janik, S.W., A.R. Wellens, M.L. Goldberg, and L.F. Dell’Osso. 1978. Eyes as the centre of focus in the visual examination of human faces. Perceptual and Motor Skills 47: 857–858.
Article Google Scholar
Jarvis, G.A. 1986. Proficiency testing: a matter of false hopes? ADFL Bulletin 18: 20–21.
Article Google Scholar
Jewitt, C. 2002. The move from page to screen: the multimodal reshaping of school English. Journal of Visual Communication 1(2): 171–196.
Article Google Scholar
Jewitt, C. 2006. Technology, literacy and learning: a multimodal approach. London: Routledge.
Google Scholar
Jewitt, C. 2009. An introduction to multimodality. In The Routledge handbook of multimodal analysis, ed. C. Jewitt, 14–27. London: Routledge.
Google Scholar
Jewitt, C. 2011. The changing pedagogic landscape of subject English in UK classrooms. In Multimodal studies: exploring issues and domains, ed. K.L. O’Halloran, and B.A. Smith, 184–201. London: Routledge.
Google Scholar
Johnson, K., and H. Johnson. 1999. Encyclopaedic dictionary of applied linguistics: a handbook for language teaching. Malden: Blackwell Publishers Inc.
Book Google Scholar
Johnson, M., and A. Tylor. 1998. Re-analysing the OPI: how much does it look like natural conversation? In Talking and testing: discourse approaches to the assessment of oral proficiency, ed. R. Young, and W. He, 27–51. Philadelphia: John Benjamins.
Chapter Google Scholar
Jöreskog, K.G. 1993. Testing structural equation models. In Testing structural equation models, ed. D. Bollen, and J.S. Long, 294–316. Newbury Park: Sage Publications.
Google Scholar
Jungheim, N.O. 1995. Assessing the unsaid: the development of tests of nonverbal ability. In Language testing in Japan, ed. J.D. Brown, and S.O. Yamashita, 149–165. Tokyo: JALT.
Google Scholar
Jungheim, N.O. 2001. The unspoken element of communicative competence: evaluating language learners’ nonverbal behaviour. In A focus on language test development: expanding the language proficiency construct across a variety of tests, ed. T. Hudson, and J.D. Brown, 1–34. Honolulu: University of Hawaii, Second Language Teaching and Curriculum Centre.
Google Scholar
Kaindl, L. 2005. Multimodality in the translation of humour in comics. In Perspectives on multimodality, ed. E. Ventola, C. Charles, and M. Kaltenbacher, 173–192. Amsterdam: John Benjamins.
Google Scholar
Kalma, A. 1992. Gazing in triads: a powerful signal in floor apportionment. British Journal of Social Psychology 31: 21–39.
Article Google Scholar
Kane, T. M. 1990. An argument-based approach to validation. Iowa: The American College Testing Program.
Google Scholar
Kane, M.T. 1992. An argument-based approach to validity. Psychological Bulletin 112(3): 527–535.
Article Google Scholar
Kane, M.T. 1994. Validating interpretative arguments for licensure and certification examinations. Evaluation and the Health Professions 17(2): 133–159.
Article Google Scholar
Kane, M.T. 2001. Current concerns in validity theory. Journal of Educational Measurement 38(4): 319–342.
Article Google Scholar
Kane, M.T. 2002. Validating high-stakes testing programs. Educational Measurement: Issues and Practice 21(1): 31–41.
Article Google Scholar
Kane, M.T. 2004. Certification testing as an illustration of argument-based validation. Measurement: Interdisciplinary Research and Perspectives, 2(3), 135–170.
Google Scholar
Kane, M.T. 2006. Validation. In Educational measurement, 4th ed, ed. R. Brennan, 17–64. Westport: American Council on Education and Praeger.
Google Scholar
Kane, M.T. 2010. Validity and fairness. Language Testing 27(2): 177–182.
Article Google Scholar
Kane, M.T., T. Crooks, and A. Cohen. 1999. Validating measures of performance. Educational Measurement: Issues and Practice 18(2): 5–17.
Article Google Scholar
Kasper, G., and K.R. Rose. 2002. Pragmatic development in a second language. Oxford: Blackwell.
Google Scholar
Kendon, A. 1967. Some functions of gaze-direction in social interaction. Acta Psychologica 26: 22–63.
Article Google Scholar
Kendon, A. 1980. Gesticulation and speech: Two aspects of the process of utterance. In The relationship of verbal and nonverbal communication, ed. M.R. Key, 207–227. The Hague: Mouton and Co.
Google Scholar
Kendon, A. 1981. The organization of behavior in face-to-face interaction: observations on the development of a methodology. In Handbook of research methods in nonverbal behavior, ed. P. Ekman, and K. Scherer, 440–505. Cambridge: Cambridge University Press.
Google Scholar
Kendon, A. 1985. Some uses of gesture. In Perspectives on silence, ed. D. Tannen, and M. Saville-Troike, 215–234. Norwood: Ablex.
Google Scholar
Kendon, A. 1996. Gesture in language acquisition. Multilingual 15: 201–214.
Article Google Scholar
Kendon, A. 2004. Gesture: visible action as utterance. Cambridge: Cambridge University Press.
Book Google Scholar
Kim, M. 2001. Detecting DIF across the different language groups in a speaking test. Language Testing 18(1): 89–114.
Article Google Scholar
Kim, Y. 2009. An investigation into native and non-native teachers’ judgments of oral English performance: a mixed methods approach. Language Testing 26(2): 187–217.
Article Google Scholar
Kleinke, C.L. 1986. Gaze and eye contact: a research review. Psychological Bulletin 100(1): 78–100.
Article Google Scholar
Knoch, U. 2009. Diagnostic writing assessment: the development and validation of a rating scale. Frankfurt: Peter Lang.
Google Scholar
Knox, J.S. 2008. Online newspapers and TESOL classrooms: a multimodal perspective. In Multimodal semiotics: functional analysis in contexts of education, ed. L. Unsworth, 139–158. London: Continuum.
Google Scholar
Kok, A.K.C. 2004. Multisemiotic mediation in hypertext. In Multimodal discourse analysis: systemic-functional perspectives, ed. K.L. O’Halloran, 131–159. London: Continuum.
Google Scholar
Kondo-Brown, K. 2002. A FACETS analysis of rater bias in measuring Japanese second language writing performance. Language Testing 19(1): 3–31.
Article Google Scholar
Kormos, J. 1999. Simulating conversations in oral-proficiency assessments: a conversation analysis of role plays and non-scripted interviews in language exams. Language Testing 16(2): 163–188.
Google Scholar
Kress, G. 2000. Design and transformation: new theories of meaning. In Multiliteracies: literacy learning and the design of social futures, ed. B. Cope, and M. Kalantzis, 153–161. South Yarra: Macmillan Publishers Australia Pte Ltd.
Google Scholar
Kress, G., et al. 2001. Multimodal teaching and learning: the rhetorics of the science classroom. London: Continuum.
Google Scholar
Kress, G., and T. van Leeuwen. 1996. Reading images: the grammar of visual design. London: Routledge.
Google Scholar
Kress, G., and T. van Leeuwen. 1998. The (critical) analysis of newspaper layout. In Approaches to media discourse, ed. A. Bell, and P. Garrett, 186–219. Oxford: Blackwell.
Google Scholar
Kress, G., and T. van Leeuwen. 2001. Multimodal discourse: the modes and media of contemporary communication. London: Edward Arnold.
Google Scholar
Kress, G., and T. van Leeuwen. 2002. Colour as a semiotic mode: notes for a grammar of colour. Visual Communication 3: 343–368.
Article Google Scholar
Kress, G., and T. van Leeuwen. 2006. Reading images: the grammar of visual design, 2nd ed. London: Routledge.
Google Scholar
Kress, G., et al. 2005. English in urban classrooms: a multimodal perspective on teaching and learning. London: Routledge.
Book Google Scholar
Kunnan, A.J. 1995. Test taker characteristics and test performance: a structural modelling approach. Cambridge: Cambridge University Press.
Google Scholar
Kunnan, A.J. (ed.). 2000. Fairness and validation in language assessment. Cambridge: Cambridge University Press.
Google Scholar
Kunnan, A.J. 2004. Test fairness. In European language testing in a global context, ed. M. Milanovic, and C.J. Weir, 27–48. Cambridge: Cambridge University Press.
Google Scholar
Kunnan, A.J. 2005. Language assessment from a wider context. In Handbook of research in second language learning, ed. E. Hinkel, 779–794. Mahwah: Lawrence Erlbaum Associates.
Google Scholar
Kunnan, A.J. 2008. Towards a model of test evaluation: using the test fairness and wider context frameworks. In Multilingualism and assessment: achieving transparency, assuring quality, sustaining diversity. Papers from the ALTE Conference in Berlin, Germany, ed. L. Taylor, and C.J. Weir, 229–251. Cambridge: Cambridge University Press.
Google Scholar
Kunnan, A.J. 2010. Fairness matters and Toulmin’s argument structures. Language Testing 24(2): 183–189.
Article Google Scholar
Lado, R. 1961. Language testing. New York: McGraw-Hill.
Google Scholar
Langenfeld, T.E., and L.M. Crocker. 1994. The evolution of validity theory: publish school testing, the courts, and incompatible interpretations. Educational Assessment 2(2): 149–165.
Article Google Scholar
Lantolf, J., and W. Frawley. 1985. Oral proficiency testing: a critical analysis. The Modern Language Journal 69(3): 337–345.
Article Google Scholar
Lantolf, J., and W. Frawley. 1988. Proficiency, understanding the construct. Studies in Second Language Acquisition 10(2): 181–196.
Article Google Scholar
Larsen-Freeman, D. (ed.). 1980. Discourse analysis in second language research. Rowley: Newbury House.
Google Scholar
Lazaraton, A. 1991. A conversation analysis of structure and interaction in the language interview. Unpublished Ph.D. thesis, University of California at Los Angeles, USA.
Google Scholar
Lazaraton, A. 1992. The structural organisation of a language interview: a conversational analytic perspective. System 20(3): 373–386.
Article Google Scholar
Lazaraton, A. 1995. Qualitative research in TESOL: a progress report. TESOL Quarterly 29: 455–472.
Article Google Scholar
Lazaraton, A. 1996a. Interlocutor support in oral proficiency interviews: the case of CASE. Language Testing 13(2): 151–172.
Article Google Scholar
Lazaraton, A. 1996b. A qualitative approach to monitoring examiner conduct in CASE. In Studies in language testing 3: performance testing, cognition, and assessment: selected papers from the 15th Language Testing Research Colloquium, Cambridge and Arnhem, ed. M. Milanovic, and N. Saville, 18–33. Cambridge: Cambridge University Press.
Google Scholar
Lazaraton, A. 2002. A qualitative approach to the validation of oral language tests. Cambridge: Cambridge University Press.
Google Scholar
Lazaraton, A. 2008. Utilising qualitative methods for assessment. In Encyclopaedia of language and education, 2nd edn. Vol. 7: Language Testing and Assessment, pp. 197–209. New York: Springer.
Google Scholar
Leathers, D.G., and H.M. Eaves. 2008. Successful nonverbal communication: principles and applications, 4th ed. New York: Pearson Education Inc.
Google Scholar
Lemke, J.L. 2002. Travels in hypermodality. Visual Communication 1(3): 299–325.
Article Google Scholar
Lennon, P. 1990. Investigating fluency in EFL: a quantitative approach. Language Learning 40(3): 387–417.
Article Google Scholar
Leung, C. 2005a. Convival communication: recontextualising communicative competence. International Journal of Applied Linguistics 15(2): 119–143.
Article Google Scholar
Leung, C. 2005b. Classroom teacher assessment of second language development: construct as practice. In Handbook of research in second language teaching and learning, ed. E. Hinkel, 869–888. Mahwah: Lawrence Erlbaum Associates.
Google Scholar
Leung, C., and B. Mohan. 2004. Teacher formative assessment and talk in classroom contexts: assessment as discourse and assessment of discourse. Language Testing 21(3): 335–359.
Article Google Scholar
Levine, P., and R. Scollon (eds.). 2004. Discourse and technology: multimodal discourse analysis. Washington: Georgetown University Press.
Google Scholar
Levinson, S.C. 1983. Pragmatics. Cambridge: Cambridge University Press.
Google Scholar
Linn, R.L. 1994. Performance assessment: policy promises and technical measurement standards. Educational Researcher 23(9): 4–14.
Article Google Scholar
Linn, R.L. 1997. Evaluating the validity of assessments: the consequences of use. Educational Measurement: Issues and Practice 16(2): 14–16.
Article Google Scholar
Liski, E., and S. Puntanen. 1983. A study of the statistical foundations of group conversation tests in spoken English. Language Learning 33(2): 225–246.
Article Google Scholar
Little, D. 2006. The Common European Framework of Reference for Languages: content, purpose, origin, reception and impact. Language Teaching 39(3): 167–190.
Article Google Scholar
Llosa, L. 2007. Validating a standards-based classroom assessment of English proficiency: a multi-trait multi-method approach. Language Testing 24(4): 489–515.
Article Google Scholar
Lloyd-Jones, R. 1977. Primary trait scoring. In Evaluating writing: describing, measuring, judging, ed. C.R. Cooper, and L. Odell, 33–66. Urbana: National Council of Teachers of English.
Google Scholar
Long, Y., and P. Zhao. 2009. The interaction study between multimodality and metacognitive strategy in college English listening comprehension teaching. Computer Assisted Foreign Language Education 4: 58–74.
Google Scholar
Lowe, P. 1985. The ILR proficiency scale as a synthesising research principle: the view from the mountain. In Foreign language proficiency in the classroom and beyond, ed. C.J. James, 9–54. Lincolnwood: National Textbook Company.
Google Scholar
Lumley, T. 2002. Assessment criteria in a large-scale writing test: what do they really mean to the raters? Language Testing 19: 246–276.
Article Google Scholar
Lumley, T. 2005. Assessing second language writing: the rater’s perspective. New York: Peter Lang.
Google Scholar
Lumley, T., and A. Brown. 2005. Research methods in language testing. In Handbook of research in second language teaching and learning, ed. E. Hinkel, 855–933. Mahwah: Lawrence Erlbaum Associates.
Google Scholar
Lumley, T., and B. O’Sullivan. 2005. The effect of test-taker gender, audience and topic on task performance in tape-mediated assessment of speaking. Language Testing 22(4): 415–437.
Article Google Scholar
Luoma, S. 2004. Assessing speaking. Cambridge: Cambridge University Press.
Book Google Scholar
Lynch, B. 2001. Rethinking assessment from a critical perspective. Language Testing 18(4): 333–349.
Article Google Scholar
Lynch, B. 2003. Language assessment and programme evaluation. New Haven: Yale.
Google Scholar
Macken-Horarik, M. 2004. Interacting with the multimodal text: reflections on image and verbiage in ArtExpress. Visual Communication 3(1): 5–26.
Article Google Scholar
Macken-Horarik, M., L. Love, and L. Unsworth. 2011. A grammatics ‘good enough’ for school English in the 21st century: four challenges in realising the potential. Australian Journal of Language and Literacy 34(1): 9–23.
Google Scholar
Maiorani, A. 2009. The Matrix phenomenon. A linguistic and multimodal analysis. Saarbrucken: VDM Verlag.
Google Scholar
Marsh, H.W. 1988. Multi-trait multi-method analyses. In Educational research methodology, and evaluation: an international handbook, ed. J.P. Keeves, 570–578. Oxford: Pergamon.
Google Scholar
Marsh, H.W. 1989. Confirmatory factor analysis of multi-trait multi-method data: many problems and a few solutions. Applied Psychological Measurement 15: 47–70.
Article Google Scholar
Martin, J.R. 1995. Interpersonal meaning, persuasion and public discourse: Packing semiotic punch. Australian Journal of Linguistics, 15(1):33–67.
Google Scholar
Martin, J.R. 2000. Beyond exchange: Appraisal systems in English. In Evaluation in text: Authorial stance and the construction of discourse, eds. S. Hunston, and G. Thompson 142–175. Oxford: Oxford University Press.
Google Scholar
Martin, J.R. 2008. Intermodal reconciliation: mates in arms. In New literacies and the English curriculum, ed. L. Unsworth, 112–148. London: Continuum.
Google Scholar
Martin, J.R. and P.R.R., White. 2005. The language of evaluation: Appraisal in English. London: Palgrave.
Google Scholar
Martinec, R. 2000a. Types of processes in action. Semiotica 130(3): 243–268.
Google Scholar
Martinec, R. 2000b. Construction of identity in Michael Jackson’s “Jam”. Social Semiotics 10(3): 313–329.
Article Google Scholar
Martinec, R. 2001. Interpersonal resources in action. Semiotica 135(1): 117–145.
Google Scholar
Martinec, R. 2004. Gestures that co-occur with speech as a systematic resource: the realisation of experiential meanings in indexes. Social Semiotics 14(2): 193–213.
Article Google Scholar
Matsumoto, D. 2006. Culture and cultural worldviews: Do verbal descriptions about culture reflect anything other than verbal descriptions of culture? Culture and Psychology, 12(1):33–62.
Google Scholar
Matsuno, S. 2009. Self-, peer- and teacher-assessments in Japanese university EFL writing classrooms. Language Testing 26(1): 75–100.
Article Google Scholar
Matthews, M. 1990. The measurement of productive skills: doubts concerning the assessment criteria of certain public examinations. English Language Teaching Journal 44(2): 117–121.
Article Google Scholar
Matthiessen, C.M.I.M. 2007. The multimodal page: a systemic functional exploration. In New directions in the analysis of multimodal discourse, ed. T.D. Royce, and W.L. Bowcher, 1–62. Mahwah: Lawrence Erlbaum Associates.
Google Scholar
Maynard, S.K. 1987. Interactional functions of a nonverbal sign: head movement in Japanese dyadic casual conversation. Journal of Pragmatics 11: 589–606.
Article Google Scholar
Maynard, S.K. 1989. Japanese conversation: self-contextualisation through structure and interactional management. Norwood: Albex.
Google Scholar
Maynard, S.K. 1990. Understanding interactive competence in L1/L2 contrastive context: a case of backchannel behaviour in Japanese and English. In Language proficiency: defining, teaching, and testing, ed. L.A. Arena, 41–52. New York: Plenum Press.
Chapter Google Scholar
McCrimman, J.M. 1984. Writing with a purpose, 8th ed. Boston: Houghton Mifflin.
Google Scholar
McKay, P. 1995. Developing ESL proficiency descriptions for the school context: the NLLIA ESL band scales. In Language assessment in action, ed. G. Brindley, 3–34. Sydney: National Centre for English Language Teaching and Research.
Google Scholar
McNamara, T. 1990. Item response theory and the validation of an ESP test for health professionals. Language Testing 7(1): 52–76.
Article Google Scholar
McNamara, T. 1996. Measuring second language performance. London: Longman.
Google Scholar
McNamara, T. 2000. Language testing. Oxford: Oxford University Press.
Google Scholar
McNamara, T. 2001. Language assessment as social practice: challenges for research. Language Testing 18(4): 333–349.
Article Google Scholar
McNamara, T., and C. Roever. 2006. Language testing: the social dimension. Oxford: Blackwell Publishing.
Google Scholar
McNeill, D. 1979. The conceptual basis of language. Hilldale: Lawrence Erlbaum Associates.
Google Scholar
McNeill, D. 1992. Hand and mind: what gestures reveal about thought. Chicago: The University of Chicago Press.
Google Scholar
McNeill, D. 1998. Speech and gesture integration. In The nature and functions of gesture in children's communication. New directions for child development, eds. J.M. Iverson, and S. Goldin-Meadow, 11–27. San Francisco: Jossey-Bass Inc, Publishers.
Google Scholar
McNeill, D. (ed.). 2000. Language and gesture. Cambridge: Cambridge University Press.
Google Scholar
McNeill, D. 2005. Gesture and thought. Chicago: The University of Chicago Press.
Book Google Scholar
Mehrens, W.A. 1997. The consequences of consequential validity. Educational Measurement: Issues and Practice 16(2): 16–18.
Article Google Scholar
Messick, S. 1975. The standard problem: meaning and values in measurement and evaluation. American Psychologist 30(10): 955–966.
Article Google Scholar
Messick, S. 1980. Test validity and the ethics of assessment. American Psychologist 35(11): 1012–1027.
Article Google Scholar
Messick, S. 1988. The once and future issues of validity: assessing the meaning and consequences of measurement. In Test validity, eds. H. Wainer, and H.I. Braun, 33–45. Hillsdale: Lawrence Erlbaum Associates.
Google Scholar
Messick, S. 1989a. Meaning and value in test validation: the science and ethics of assessment. Educational Researcher 18(2): 5–11.
Article Google Scholar
Messick, S. 1989b. Validity. In Educational measurement, 3rd ed, ed. R.L. Linn, 13–103. New York: American Council on Education & Macmillan Publishing Company.
Google Scholar
Messick, S. 1992. Validity of test interpretation and use. In Encyclopaedia of educational research, 6th ed, ed. M.C. Alkin, 1487–1495. New York: Macmillan.
Google Scholar
Messick, S. 1994. The interplay of evidence and consequences in the validation of performance assessment. Educational Research 2(2): 13–23.
Article Google Scholar
Messick, S. 1995. Standards of validity and the validity of standards in performance assessment. Educational Measurement: Issues and Practice 14(4): 5–8.
Article Google Scholar
Messick, S. 1996. Validity and washback in language testing. Language Testing 13(3): 241–256.
Article Google Scholar
Mickan, P. 2003. What’s your score? An investigation into language descriptors for rating written performance. Canberra: IELTS Australia.
Google Scholar
Milanovic, M., N. Saville, A. Pollitt, and A. Cook. 1996. Developing and validating rating scales for CASE: theoretical concerns and analyses. In Validation in language testing, ed. A. Cumming, and R. Berwick, 15–38. Philadelphia: Multilingual Matters Ltd.
Google Scholar
Mislevy, R.J. 2003. Substance and structure in assessment arguments. Law, Probability, and Risk 2(4): 237–258.
Article Google Scholar
Mislevy, R.J., L.S. Steinberg, and R.G. Almond. 2003. On the structure of educational assessments. Measurement: Interdisciplinary Research and Perspectives 1(1):3–67.
Google Scholar
Mislevy, R.J., R.G. Almond, and L.S. Steinberg. 2002. On the roles of task model variables in assessment design. In Generating items for cognitive tests: theory and practice, ed. S. Irvine, and P. Kyllonen, 97–128. Hillsdale: Lawrence Erlbaum Associates.
Google Scholar
Morrow, K. (ed.). 2004. Insights from the Common European Framework. Oxford: Oxford University Press.
Google Scholar
Mosier, C.I. 1947. A critical examination of the concepts of face validity. Educational and Psychological Measurement 7(2): 191–205.
Article Google Scholar
Moss, P.A. 1992. Shifting conceptions of validity in educational measurement: implications for performance assessment. Review of Educational Research 62(3): 229–258.
Article Google Scholar
Munby, J. 1978. Communicative syllabus design. Cambridge: Cambridge University Press.
Google Scholar
Myford, C.M. 2002. Investigating design features of descriptive graphic rating scales. Applied Measurement in Education 15(2): 187–215.
Article Google Scholar
Nakatsuhara, F. 2009. Conversational styles in group oral tests: how is the conversation co-constructed? Unpublished Ph.D. thesis, The University of Essex, UK.
Google Scholar
Nambiar, M.K., and C. Goon. 1993. Assessment of oral skills: a comparison of scores obtained through audio recordings to those obtained through face-to-face evaluation. RELC Journal 24(1): 15–31.
Article Google Scholar
Neu, J. 1990. Assessing the role of nonverbal communication in the acquisition of communicative competence in L2. In Developing communicative competence in a second language: series on issues in second language research, ed. C.R. Scarcella, S.E. Andersen, and D.S. Krashen, 121–138. New York: Newbury House Publishers.
Google Scholar
Nevo, D., and E. Shohamy. 1984. Applying the joint committee’s evaluation standards for the assessment of alternative testing methods. Paper presented at the annual meeting of the American Educational Research Association, New Orleans.
Google Scholar
Nevo, B. 1985. Face validity revisited. Journal of Educational Measurement 22(4): 287–293.
Article Google Scholar
Norris, S. 2002. Theoretical framework for multimodal discourse analysis presented via the analysis of identity construction of two women living in Germany. Unpublished Ph.D. thesis, Georgetown University, USA.
Google Scholar
Norris, S. 2004. Analysing multimodal interaction: a methodological framework. London: Routledge.
Google Scholar
Norris, J.M. 2005. Book review: common European Framework of Reference for Languages: learning, teaching, assessment. Language Testing 22(3): 399–405.
Article Google Scholar
Norris, S., and R.H. Jones (eds.). 2005. Discourse in action: introducing mediated discourse analysis. London: Routledge.
Google Scholar
North, B. 1994. Scales of language proficiency: a survey of some existing systems. Washington, DC: Georgetown University Press.
Google Scholar
North, B. 1996. The development of a common framework scale of descriptors of language proficiency based on a theory of measurement. Unpublished Ph.D. thesis, Thames Valley University, UK.
Google Scholar
North, B. 2000. The development of a common framework scale of language proficiency. New York: Peter Lang Publishing Inc.
Google Scholar
North, B. 2003. Scales for rating language performance: descriptive models, formulation styles, and presentation formats. TOEFL Monograph, No. TOEFL-MS-24. Princeton: Educational Testing Service.
Google Scholar
North, B. 2010a. Levels and goals: central frameworks and local strategies. In The handbook of educational linguistics, ed. B. Spolsky, and F.M. Hult, 220–230. Malden: Wiley-Blackwell.
Google Scholar
North, B. 2010b. Assessment, certification and the CEFR: an overview. Plenary speech at IATEFL TEA SIG & EALTA conference, Barcelona, Spain.
Google Scholar
North, B., and G. Schneider. 1998. Scaling descriptors for language proficiency scales. Language Testing 15(2): 217–262.
Article Google Scholar
O’Halloran, K.L. 2000. Classroom discourse in mathematics: a multisemiotic analysis. Linguistics and Education 10(3): 359–388.
Article Google Scholar
O’Halloran, K.L. 2004. Visual semiosis in film. In Multimodal discourse analysis: systemic-functional perspectives, ed. K.L. O’Halloran, 109–130. London: Continuum.
Google Scholar
O’Halloran, K.L. 2005. Mathematical discourse: language, symbolism and visual images. London: Continuum.
Google Scholar
O’Halloran, K.L. 2008a. Inter-semiotic expansion of experiential meaning: hierarchical scales and metaphor in mathematics discourse. In New developments in the study of ideational meaning: from language to multimodality, ed. C. Jones, and E. Ventola, 231–254. London: Equinox.
Google Scholar
O’Halloran, K.L. 2008b. Systemic functional-multimodal discourse analysis (SF-MDA): constructing ideational meaning using language and visual imagery. Visual Communication 7(4): 443–475.
Article Google Scholar
O'Halloran, K. 2009. Historical changes in the Semiotic landscape: From calculation to computation. In The routledge handbook of multimodal analysis, ed. C. Jewitt, 98–113. UK: Routledge.
Google Scholar
O’Halloran, K.L. 2011. Multimodal discourse analysis. In Continuum companion to discourse analysis, ed. K. Hyland, and B. Paltridge, 120–137. London: Continuum.
Google Scholar
O’Halloran, K.L., and F.V. Lim. 2009. Sequential visual discourse frames. In The world told and the world shown: multisemiotic issues, ed. E. Ventola, and A.J.M. Guijarro, 139–156. Hampshire: Palgrave Macmillan.
Google Scholar
O’Loughlin, K.K. 2002. The impact of gender in oral proficiency testing. Language Testing 19(2): 169–192.
Article Google Scholar
O’Malley, J.M., and A.U. Chamot. 1990. Learning strategies in second language acquisition. Cambridge: Cambridge University Press.
Book Google Scholar
O’Toole, M. 1994. The language of displayed art. London: Leicester University Press.
Google Scholar
O’Toole, M. 2010. The language of displayed art, 2nd ed. London: Routledge.
Google Scholar
O’Toole, M. 2011. Art vs. computer animation: integrity and technology in “South Park”. In Multimodal studies: exploring issues and domains, ed. K.L. O’Halloran, and B.A. Smith, 239–252. London: Routledge.
Google Scholar
Ockey, G.J. 2001. Is the oral interview superior to the group oral? Working paper on language acquisition and education, International University of Japan, vol. 11, pp. 22–41.
Google Scholar
Oller, J.W. 1979. Language tests at school. London: Longman.
Google Scholar
Oller, J.W. 1983. Evidence for a general language proficiency factor: an expectancy grammar. In Issues in language testing research, ed. J.W. Oller, 3–10. Rowley: Newbury House.
Google Scholar
Oller, J.W., and F.B. Hinofotis. 1980. Two mutually exclusive hypotheses about second language ability: indivisible or partially divisible competence. In Research in language testing, ed. J.W. Oller, and K. Perkins, 13–23. Rowley: Newbury House.
Google Scholar
Oreström, B. 1983. Turn-taking in English conversation. Lund Studies in English 66, CWK Gleerup.
Google Scholar
Painter, C. 2007. Children’s picture book narratives: reading sequences of images. In Advances in language and education, ed. A. McCabe, M. O’Donnell, and R. Whittaker, 40–59. London: Continuum.
Google Scholar
Painter, C. 2008. The role of colour in children’s picture books. In New literacies and the English curriculum, ed. L. Unsworth, 89–111. London: Continuum.
Google Scholar
Painter, C., J.R. Martin, and L. Unsworth. 2013. Reading visual narratives: Image analysis of children’s picture books. Bristol: Equinox Publishing.
Google Scholar
Patri, M. 2002. The influence of peer feedback on self- and peer-assessment. Language Testing 19(2): 109–132.
Article Google Scholar
Pawley, A., and F.H. Syder. 1983. Two puzzles for linguistic theory: nativelike selection and nativelike fluency. In Language and communication, ed. J.C. Richards, and R.W. Schmidt, 191–225. London: Longman.
Google Scholar
Pienemann, M., and M. Johnston. 1987. Factors influencing the development of language proficiency. In Applying second language acquisition research, ed. D. Nunan, 89–94. Adelaide: National Curriculum Resource Centre.
Google Scholar
Pike, K.L. 1967. Language in relation to a unified theory of the structure of human behaviour, 2nd ed. The Hague: Mouton & Co.
Book Google Scholar
Poggi, I. 2001. The lexicon of the conductor’s face. In Language, vision and music, ed. P. McKevitt, S. Nuallsin, and C. Mulvihill, 271–284. Amsterdam: John Benjamins.
Google Scholar
Pollitt, A., and C. Hutchinson. 1987. Calibrating graded assessment: Rasch partial credit analysis of performance in writing. Language Testing 4(1): 72–92.
Article Google Scholar
Pomerantz, A., and B.J. Fehr. 1997. Conversation analysis: An approach to the study of social action as sense making practices. In Discourse as social action, discourse studies: a multidisciplinary introduction, vol. 2, ed. T.A. van Dijk, 64–91. London: Sage Publications.
Google Scholar
Popham, W.J. 1990. Modern educational measurement: a practitioner’s perspective. New York: Prentice Hall.
Google Scholar
Popham, W.J. 1997. Consequential validity: right concern—wrong concept. Educational Measurement: Issues and Practice 16(2): 9–13.
Article Google Scholar
Popham, W.J. 2008. Transformative assessment. Alexandria: Association for Supervision and Curriculum Development.
Google Scholar
Psathas, G. 1995. Conversation analysis: the study of talk-in-interaction. Thousand Oaks: Sage.
Google Scholar
Purpura, J. 1999. Learner strategy use and performance on language tests: a structural equation modelling approach. Cambridge: Cambridge University Press.
Google Scholar
Purpura, J. 2004. Assessing grammar. Cambridge: Cambridge University Press.
Book Google Scholar
Purpura, J. 2008. Assessing communicative language ability. In Encyclopaedia of language and education, eds. E. Shohamy, and N.H. Hornberger, 2nd edn. Vol. 7: language testing and assessment, pp. 53–68. New York: Springer.
Google Scholar
Ravelli, L.J. 2000. Beyond shopping: constructing the Sydney Olympics in three-dimensional text. Text 20(4): 489–515.
Google Scholar
Raykov, T., and G.A. Marcoulides. 2006. A first course in structural equation modeling, 2nd ed. Mahwah: Lawrence Erlbaum Associates, Inc.
Google Scholar
Rea-Dickens, P. 2006. Currents and eddies in the discourse of assessment: a learning-focused interpretation. International Journal of Applied Linguistics 16(2): 163–188.
Article Google Scholar
Richards, J.C., and R.W. Schmidt. 1983. Conversation analysis. In Language and communication, ed. J.C. Richards, and R.W. Schmidt, 117–153. London: Longman.
Google Scholar
Richards, J.C., et al. 1992. Longman dictionary of language teaching and applied linguistics. London: Longman.
Google Scholar
Riley, P. 1996. Developmental sociolinguistics and the competence/performance distinction. In Performance and competence in second language acquisition, ed. G. Brown, K. Malinkjaer, and J. Williams, 114–135. Cambridge: Cambridge University Press.
Google Scholar
Ross, S.J. 1998. Self-assessment in second language testing: a meta-analysis and analysis of experiential factors. Language Testing 15(1): 1–20.
Google Scholar
Ross, S.J. 2005. The impact of assessment method on foreign language proficiency growth. Applied Linguistics 26(3): 317–342.
Article Google Scholar
Ross, S.J., and R. Berwick. 1992. The discourse of accommodation in oral proficiency interviews. Studies in Second Language Acquisition 14(2): 159–176.
Article Google Scholar
Royce, T. 2007. Multimodal communicative competence in second language contexts. In New directions in the analysis of multimodal discourse, ed. T. Royce, and W. Bowcher, 361–390. New York: Routledge.
Google Scholar
Ruesch, J., and W. Kees. 1956. Nonverbal communication: notes on the visual perception of human relations. Berkeley: University of California Press.
Google Scholar
Sacks, H. 1992. Lectures on conversation, vol. 1&2. Cambridge: Blackwell.
Google Scholar
Sacks, H., E.A. Schegloff, and G. Jefferson. 1974. A simplest systematic for the organisation of turn-taking in conversation. Language 50: 696–735.
Article Google Scholar
Sadler, D.R. 1989. Formative assessment and the design of instructional systems. Instructional Science 18(2): 119–144.
Article Google Scholar
Saitz, R., and E.J. Cervenka. 1972. Handbook of gestures. Mouton: The Hague.
Google Scholar
Sajavaara, K. 1987. Second language speech production: factors affecting fluency. In Psycholinguistic models of production, ed. H.D. Dechert, and M. Raupach, 45–65. Norwood: Ablex.
Google Scholar
Sasaki, M. 1993. Relationships among second language proficiency, foreign language aptitude and intelligence: a structural equation modelling approach. Language Learning 43: 313–344.
Article Google Scholar
Savignon, S.J. 1983. Communicative competence: theory and classroom practice; texts and contexts in second language learning. Reading: Addison-Wesley.
Google Scholar
Savignon, S.J. 1997. Communicative competence: theories and classroom practice. New York: McGraw-Hill.
Google Scholar
Sawaki, Y. 2007. Construct validation of analytic rating scales in a speaking assessment: reporting a score profile and a composite. Language Testing 24(3): 355–390.
Article Google Scholar
Schiffrin, D. 1994. Approaches to discourse. Oxford: Basil Blackwell.
Google Scholar
Schlenker, B.R. 1980. Impression management: the self-concept, social identity, and interpersonal relations. Monterey: Brooks/Cole.
Google Scholar
Schmidt, R. 1992. Psychological mechanisms underlying second language fluency. Studies in Second Language Acquisition 3: 357–385.
Article Google Scholar
Schmitt, N., and D.M. Stults. 1986. Methodology review: analysis of multi-trait multi-method matrices. Applied Psychological Measurement 10: 1–22.
Article Google Scholar
Schoonen, R., A. Van Gelderen, K. De Glopper, J. Hulstijn, P. Snellings, A. Simis, and M. Stevenson. 2002. Linguistic knowledge, metacognitive knowledge, and retrieval speed in L1, L2 and EFL writing: a structural equation modelling approach. In New directions for research in L2 writing, ed. S. Ransdell, and M.L. Barbier, 101–122. Dordrecht: Kluwer Academic.
Chapter Google Scholar
Scollon, R. 2001. Mediated discourse: the nexus of practice. London: Routledge.
Book Google Scholar
Scollon, R., and S.W. Scollon. 2003. Discourses in place: language in the material world. London: Routledge.
Book Google Scholar
Scollon, R., and W.B.K. Scollon. 2004. Nexus analysis: Discourse and the emerging internet. London: Routledge.
Google Scholar
Scollon, R., and S.W. Scollon. 2009. Multimodality and language: a retrospective and prospective view. In The Routledge handbook of multimodal analysis, ed. C. Jewitt, 170–180. London: Routledge.
Google Scholar
Scriven, M. 1967. The methodology of evaluation. In Perspectives on curriculum evaluation, ed. R.W. Tylor, R.M. Gagne, and M. Scriven, 39–83. Chicago: Rand McNally.
Google Scholar
Searle, J.R. 1969. Speech act: an essay in the philosophy of language. Cambridge: Cambridge University Press.
Book Google Scholar
Shepard, L.A. 1993. Evaluating test validity. In Review of research in education, vol. 19, ed. L. Darling-Hammond, 405–450. Washington DC: American Educational Research Association.
Google Scholar
Shepard, L.A. 1997. The centrality of test use and consequences for test validity. Educational Measurement: Issues and Practice, 16(2), 5–8, 13, 24.
Google Scholar
Shepard, L.A. 2000. The role of assessment in a learning culture. Educational Researcher 29(7): 4–14.
Article Google Scholar
Shohamy, E. 1981. Inter-rater and intra-rater reliability of the oral interview and concurrent validity with cloze procedure. In The construct validation of tests of communicative competence, ed. A.S. Palmer, J.M. Groot, and G.A. Trosper, 94–105. Washington, DC: TESOL.
Google Scholar
Shohamy, E. 1996. Competence and performance in language testing. In Performance and competence in second language acquisition, ed. G. Brown, K. Malmkjaer, and J. William, 138–151. Cambridge: Cambridge University Press.
Google Scholar
Shohamy, E. 2001. The power of tests: a critical perspective of the uses of language tests. London: Longman.
Google Scholar
Shohamy, E., C.M. Gordon, and R. Kraemer. 1992. The effect of raters’ background and training on the reliability of direct writing tests. Modern Language Journal 76: 27–33.
Article Google Scholar
Shute, V.J. 2008. Focus on formative feedback. Review of Educational Research 78(1): 153–189.
Article Google Scholar
Simpson, J. 2003. Report on BAAL/CUP seminar on multimodality and applied linguistics. Reading, UK.
Google Scholar
Sinclair, J.M., and M. Coulthard. 1975. Towards an analysis of discourse. Oxford: Oxford University Press.
Google Scholar
Skehan, P. 1984. Issues in the testing of English for specific purposes. Language Testing 1(2): 202–220.
Article Google Scholar
Skehan, P. 1995. Analysability, accessibility and ability for use. In Principles and practice in applied linguistics, ed. G. Cook, and B. Seidlhofer, 91–106. Oxford: Oxford University Press.
Google Scholar
Skehan, P. 1996. Second language acquisition research and task-based instruction. In Challenge and change in language teaching, ed. J. Willis, and D. Willis, 17–30. Oxford: Heinemann.
Google Scholar
Smith, D. 2000. Rater judgments in the direct assessment of competency-based second language writing ability. In Studies in immigrant English language assessment, vol. 1, ed. G. Brindley, 159–189. Sydney: Macquarie University.
Google Scholar
Sparhawk, C.M. 1978. Contrastive identificational features of Persian gesture. Semiotica 24: 49–86.
Article Google Scholar
Spolsky, B. 1986. A multiple choice for language testers. Language Testing 3(2): 147–158.
Article Google Scholar
Spolsky, B. 1989a. Communicative competence, language proficiency and beyond. Applied Linguistics 10(2): 138–156.
Article Google Scholar
Spolsky, B. 1989b. Conditions for second language learning: introduction to a general theory. Oxford: Oxford University Press.
Google Scholar
Spolsky, B. 1993. Testing and examinations in a national foreign language policy. In National foreign language policies: practice and prospects, ed. K. Sajavaara, S. Takala, D. Lambert, and C. Morfit, 124–153. Jyväskyla: Institute for Education Research, University of Jyväskyla.
Google Scholar
Spolsky, B. 2008. Introduction: language testing at 25: maturity and responsibility? Language Testing 25(3): 297–305.
Article Google Scholar
Stein, P. 2008. Multimodal pedagogies in diverse classrooms: representation, rights and resources. London: Routledge.
Google Scholar
Stern, H.H. 1978. The formal-functional distinction in language pedagogy: a conceptual clarification. Paper presented at the 5th AILA congress, Montreal, Canada.
Google Scholar
Stöckl, H. 2004. In between modes: language and image in printed media. In Perspectives on multimodality, ed. E. Ventola, C. Charles, and M. Kaltenbacher, 9–30. Amsterdam: John Benjamins.
Chapter Google Scholar
Street, B.V. (ed.). 1993. Cross-cultural approaches to literacy. Cambridge: Cambridge University Press.
Google Scholar
Suppe, F. 1977. The structure of scientific theories, 2nd ed. Urbana: University of Illinois Press.
Google Scholar
Swain, M. 1985. Communicative competence: some roles of comprehensible input and comprehensible output in its development. In Input in second language acquisition, ed. S. Gass, and C. Madden, 235–256. New York: Newbury House.
Google Scholar
Tan, S. 2009. A systemic functional framework for the analysis of corporate television advertisements. In The world told and the world shown: multisemiotic issues, ed. E. Ventola, and A.J.M. Guijarro, 157–182. Hampshire: Palgrave Macmillan.
Google Scholar
Tan, S. 2010. Modelling engagement in a web-based advertising campaign. Visual Communication 9(1): 91–115.
Article Google Scholar
Tarone, E.E., and G. Yule. 1989. Focus on the language learner: approaches to identifying and meeting the needs of second language learners. Oxford: Oxford University Press.
Google Scholar
Teasdale, A., and C. Leung. 2000. Teacher assessment and psychometric theory: a case of paradigm crossing? Language Testing 17(2): 163–184.
Article Google Scholar
Thibault, P.J. 2000. The multimodal transcription of a television advertisement. In Multimodality and multimediality in the distance learning age, ed. A. Baldry, 311–385. Campobasso, Italy: Palladino.
Google Scholar
Thorndike, E.L. 1920. A constant error in psychological ratings. Journal of Applied Psychology 4: 469–477.
Google Scholar
Thorndike, R.M. 1997. Measurement and evaluation in psychology and education. Upper Saddle River: Merrill.
Google Scholar
Tomasello, M. 2003. Constructing a language: a usage-based theory of language acquisition. London: Harvard University Press.
Google Scholar
Toulmin, S.E. 2003. The uses of argument. Cambridge: Cambridge University Press.
Google Scholar
Tseng, C., and J. Bateman. 2010. Chain and choice in filmic narrative: an analysis of multimodal narrative construction in The Fountain. In Narrative revisited, ed. C.R. Hoffmann, 213–244. Amsterdam: John Benjamins.
Chapter Google Scholar
Turner, C.E. 1989. The underlying factor structure of L2 cloze test performance in Francophone, University- level students: Causal modelling as an approach to construct validation. Language Testing, 6(2):172–197.
Google Scholar
Turner, C.E., and J.A. Upshur. 2002. Rating scales derived from student samples: effects of the scale maker and the student sample on scale content and student scores. TESOL Quarterly 36(1): 49–70.
Article Google Scholar
Underhill, N. 1987. Testing spoken English. Cambridge: Cambridge University Press.
Google Scholar
Unsworth, L., and E. Chan. 2009. Bridging multimodal literacies and national assessment programs in literacy. Australian Journal of Language and Literacy 32(3): 245–257.
Google Scholar
Upshur, J.A., and C.E. Turner. 1995. Constructing rating scales for second language tests. ELT Journal 49(1): 3–12.
Article Google Scholar
Upshur, J.A., and C.E. Turner. 1999. Systematic effects in the rating of second language speaking ability: test method and learner discourse. Language Testing 16(1): 82–111.
Google Scholar
van Dijk, T.A. 1977. Text and context: exploration in the semantics and pragmatics of discourse. London: Longman.
Google Scholar
van Ek, J.A. 1975. The threshold level in a European unit/credit system for modern language learning by adults. Strasbourg: Council of Europe.
Google Scholar
van Leeuwen, T. 1999. Speech, sound and music. London: Macmillan.
Book Google Scholar
van Leeuwen, T. 2001. Visual racism. In The semiotics of racism, ed. R. Wodak, and M. Reisigl, 333–350. Vienna: Passagen Verlag.
Google Scholar
van Leeuwen, T. 2011. The language of colour: an introduction. London: Routledge.
Google Scholar
van Lier, L. 1989. Reeling, writhing, drawling, stretching, and fainting in coils: oral proficiency interviews as conversation. TESOL Quarterly 23(3): 489–508.
Article Google Scholar
van Moere, A. 2007. Group oral test: how does task affect candidate performance and test score? Unpublished Ph.D. thesis, The University of Lancaster, UK.
Google Scholar
Vaughan, C. 1991. Holistic assessment: what goes on in the rater’s mind? In Assessing second language writing in academic contexts, ed. L. Hamp-Lyons, 111–125. Norwood: Ablex.
Google Scholar
Verhoeven, L. 1997. Sociolinguistics and education. In The handbook of sociolinguistics, ed. F. Coulmas, 389–404. Oxford: Blackwell.
Google Scholar
Wainer, H., and H.I. Braun (eds.). 1988. Test validity. Hilldale: Lawrence Erlbaum Associates.
Google Scholar
Wang, Y. 2009. The design of multimodal listening autonomous learning and its effect. Computer Assisted Foreign Language Education 6: 62–65.
Google Scholar
Wang, L., G. Beckett, and L. Brown. 2006. Controversies of standardised assessment in school accountability reform: a critical synthesis of multidisciplinary research evidence. Applied Measurement in Education 19(4): 305–328.
Article Google Scholar
Webbink, P. 1986. The power of the eyes. New York: Springer.
Google Scholar
Wei, Q. 2009. A study on multimodality and college students’ multiliteracies. Computer Assisted Foreign Language Education 2: 28–32.
Google Scholar
Weigle, S.C. 1994. Effects of training on raters of ESL compositions. Language Testing 11(2): 197–223.
Article Google Scholar
Weigle, S.C. 1999. Investigating rater/prompt interactions in writing assessment: quantitative and qualitative approaches. Assessing Writing 6(2): 145–178.
Article Google Scholar
Weigle, S.C. 2002. Assessing writing. Cambridge: Cambridge University Press.
Book Google Scholar
Weiner, M., et al. 1972. Nonverbal behaviour and nonverbal communication. Psychological Review 79: 185–214.
Article Google Scholar
Weir, C.J. 1990. Communicative language testing. Englewood Cliffs: Prentice Hall Regents.
Google Scholar
Weir, C.J. 2005. Limitations of the Common European Framework of Reference for Languages (CEFR) for developing comparable examinations and tests. Language Testing 22(3): 281–300.
Article Google Scholar
White, E.M. 1985. Teaching and assessing writing. San Francisco: Jossey-Bass Inc.
Google Scholar
White, S. 1989. Backchannels across cultures: a study of Americans and Japanese. Language in Society 18: 59–76.
Article Google Scholar
Widaman, K.F. 1985. Hierarchically tested covariance structure models for multi-trait multi-method data. Applied Psychological Measurement 9: 1–26.
Article Google Scholar
Widdowson, H.G. 1978. Teaching language as communication. Oxford: Oxford University Press.
Google Scholar
Wolfe, E.W. 1997. The relationship between essay reading style and scoring proficiency in a psychometric scoring system. Assessing Writing 4(1): 83–106.
Article Google Scholar
Wolfe, E.W., C. Kao, and M. Ranney. 1998. Cognitive differences in proficient and non-proficient essay scorers. Written Communication 15: 465–492.
Article Google Scholar
Wolfe-Quintero, K., S. Inagaki, and H.-Y. Kim. 1998. Second language development in writing: measures of fluency, accuracy and complexity. Honolulu: University of Hawaii at Manoa.
Google Scholar
Wolfson, N. 1989. Perspectives: sociolinguistics and TESOL. New York: Newbury House.
Google Scholar
Wylie, L. 1977. Beaux gesters: a guide to French body talk. New York: E. P. Dutton.
Google Scholar
Xi, X. 2010. How do we go about investigating test fairness? Language Testing 27(2): 147–170.
Article Google Scholar
Yamashiro, A.D. 2002. Using structural equation modelling for construct validation of an English as a foreign language public speaking rating scale. Unpublished Ph.D. thesis, Temple University, USA.
Google Scholar
Yang, H., and C.J. Weir. 1998. Validation study of the national College English Test. Shanghai: Shanghai Foreign Language Education Press.
Google Scholar
Young, R. 1995. Discontinuous language development and its implications for oral proficiency rating scales. Applied Language Learning 6: 13–26.
Google Scholar
Young, R., and W. He. 1998a. Language proficiency interviews: a discourse approach. In Talking and testing: discourse approaches to the assessment of oral proficiency, ed. R. Young, and W. He, 1–24. Philadelphia: John Benjamins.
Chapter Google Scholar
Young, R., and W. He (eds.). 1998b. Talking and testing: discourse approaches to the assessment of oral proficiency. Philadelphia: John Benjamins.
Google Scholar
Zebrowitz, L.A. 1997. Reading faces: window to the soul?. Boulder: Westview Press.
Google Scholar
Zhang, D. 2009. On a synthetic theoretical framework for multimodal discourse analysis. Foreign Languages in China 1: 24–30.
Google Scholar
Zhang, Z. 2010. A co-relational study of multimodal PPT presentation and students’ learning achievements. Foreign Languages in China 3: 54–58.
Google Scholar
Zhang, D., and L. Wang. 2010. The synergy of different modes in multimodal discourse and their realisation in foreign language teaching. Foreign Language Research 2: 97–102.
Google Scholar
Zhu, Y. 2007. Theory and methodology of multimodal discourse analysis. Foreign Language Research 5: 82–86.
Google Scholar
Zhu, Y. 2008. Studies on multiliteracy ability and reflections on their effects on teaching.
Google Scholar

Download references

Author information

Authors and Affiliations

Faculty of English Language and Culture, Guangdong University of Foreign Studies, Guangzhou, Guangdong, China
Mingwei Pan

Authors

Mingwei Pan
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Mingwei Pan .

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Pan, M. (2016). Literature Review. In: Nonverbal Delivery in Speaking Assessment. Springer, Singapore. https://doi.org/10.1007/978-981-10-0170-3_2

Download citation

DOI: https://doi.org/10.1007/978-981-10-0170-3_2
Published: 28 November 2015
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-0169-7
Online ISBN: 978-981-10-0170-3
eBook Packages: Social SciencesSocial Sciences (R0)

Publish with us

Policies and ethics