Abstract
This chapter illustrates how the knowledge of linguistics (the scientific study of language) and psycholinguistics (the study of how speech helps to shed light on the human mind and behavior) can help to optimize the design of interactive voice response (IVR) systems for usability and performance. In particular, we examine some of the ways in which linguistics and psycholinguistics can influence the design of conversational (natural language) IVR systems. The central question tackled is what are some of the salient discoveries about formal methodology or the structure of language that can facilitate optimal human-computer interaction? In this regard, we build on the relevance of conversational theory to the fundamental design issue of when to bail out of the conversation due to user frustration. We introduce some new dimensions in the interface of linguistics and psycholinguistics with voice interactive systems’ design, regarding: call flows and the relevance of phrase structure diagrams; natural language understanding (grammars); and the relevance of lexical semantics to labeling utterances in natural language categorization techniques. As with most of the published work in this area, we also deal with the relevance of conversational theory to prompting and dialog design by exploring the application of linguistic theories in the areas of ambiguity, synonymy, and polysemy to grammar development, dialog design, and in understanding vagueness in natural language processing. Lastly, we discuss the concept of "structural simplification" as it relates to the ways in which humans may use different speech strategies to interact with machines in contrast to interactions with other humans. Based on sociolinguistic analysis, we propose that the language style used by humans when interacting with speech systems should be categorized as a register.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Austin, J.L. (1962). How to do things with words. Oxford: Clarendon Press.
Baber, C. (1993). Developing interactive speech technology. In C. Baber & J. M. Noyes (Eds.), Interactive speech technology: Human factors issues in the application of speech input/output to computers(pp. 1-18). London: Taylor & Francis.
Biber, D. (1995). Dimensions of register variation: A cross-linguistic comparison. Cambridge: Cambridge University Press.
Blanchard, H. E., & Stewart, O. T. (2004). Conversational re-prompting in natural language dialog. Proceedings of the 48$th$Annual Meeting of the Human Factors and Ergonomics Society(pp. 708-711). Santa Monica, CA: Human Factors and Ergonomics Society.
Boyce, S. J. (1999). Spoken natural dialog systems: User interface issues for the future. In D. Gardner-Bonneau (Ed.), Human factors and voice interactive systems (pp. 37-61). Norwell, MA: Kluwer Academic Publishers.
Chomsky, N. (1965). Aspects of the theory of syntax.Cambridge, MA: MIT Press.
Chomsky, N. (1982). Some concepts and consequencies of the theory of government and binding. Cambrdige, MA: MIT Press.
Clark, H. H., & Clark, E. V. (1977). Psychology and language. New York: Harcourt Brace Jovanovich.
Cohen, M., Giangola, P., & Balogh, J. (2003). Voice user interface design. Boston: Addison-Wesley.
Church, K. W. (1983). Phrase-structure parsing: A method for taking advantage of allophonic constraints, Unpublished Ph.D dissertation, Massachusetts Institute of Technology, Electrical Engineering and Computer Science.
Falzon, P. (1990). Human-computer interaction: Lessons from human-human communication. In P. Falzon (Ed.), Cognitive ergonomics: Understanding, learning and designing human-computer interaction (pp. 51-66). London: Academic Press.
Franzke, M., Marx, A. N., Roberts, T. L., & Engelbeck, G. E., 1993, Is speech recognition usable? An exploration of the usability of a speech-based voice mail interface, SIGCHI Bulletin, 25(3), 49-51.
Glass, J., Flammia, G., Goodine, D., Phillips, M., Polifroni, J., Sakai, S., Seneff, S., and Zue, V. (1995). Multilingual spoken-language understanding in the MIT VOYAGER system. Speech Communications, 17(1-2), 1-18.
Gorin, A., Parker, B., Sachs, P., & Wilpon, J. (1996). How may I help you? Proceedings of Interactive Voice Technology for Telecommunications Applications (pp. 57-60). Piscataway, NJ: IEEE.
Grice, H. P. (1975). Logic and conversation. In P. Colege & J, L, Morgan (Eds.), Syntax and semantics: Speech acts(pp. 41-58). New York: Academic Press.
Grosz, B. (1977). The representation and use of focus in dialogue understanding, Unpublished Ph.D dissertation, University of California, Berkeley.
Halliday, M.A.K., & Hasan, R. (1976). Cohesion in English. London: Longman.
Johnstone, A., Berry, U., Nguyen, T., & Asper, A. (1995). There was a long pause: Influencing turn-taking behavior in human-human and human-computer spoken dialogs. International Journal of Human-Computer Studies, 42(4), 383-411.
Kennedy, A., Wilkes, A., Elder, L., & Murray, W. S. (1988). Dialogue with machines. Cognition, 30(1), 37-72.
Leiser, R. G. (1989). Improving natural language and speech interfaces by use of metalinguistic phenomena. Applied Ergonomics, 20, 168-173.
Linde, C. (1974). Information structures in discourse, Unpublished Ph.D dissertation, Columbia University.
Mitchell, R. W. (2001). Americans’ talk to dogs: Similarities and differences with talk to infants. Research on Language and Social Interaction, 34(2), 183-210.
Newport, E. L. (1977). Motherese: The speech of mothers to young children. In N. Castellan, D. Pisoni, & G. Potts (Eds.), Cognitive theory, vol. 2 (pp. 177-217). Hillsdale, NJ: Erlbaum.
O’Grady, W., & Dobrovolsky, M. (1992). Contemporary linguistic analysis: An introduction. Toronto: Copp Clark Pitman.
Oviatt, S., Darves, C., & Coulston, R. (2004). Toward adaptive conversational interfaces: Modeling speech convergence with animated personas, ACM Transactions on Computer-Human Interaction, 11(3), 300-328.
Quirk, R., & Greenbaum, S. (1973). A concise grammar of contemporary English. New York: Harcourt Brace Jovanovich.
Rabiner, L. R., & Juang, B. H. (1993). Fundamentals of speech recognition. Englewood Cliffs, NJ: Prentice-Hall.
Reichman, R. (1985). Getting computers to talk like you and me. Cambridge, MA: MIT Press.
Richards, M. A., & Underwood, K. (1984). Talking to machines: How are people naturally inclined to speak? Contemporary Ergonomics 1984: Proceedings of the Ergonomics Society’s Conference 2-5 (pp. 62-67). London: Taylor & Francis.
Sacks, H., Schegloff, E., & Jefferson, G. (1974). A simplest systematics for the organization of turntaking for conversation, Language, 50(4), 696-735.
Searle, J. R. (1976). The classification of illocutionary acts, Language in Society, 5, 1-24.
Snow, C. E., & Ferguson, C. A., (Eds.). (1977). Talking to children: Language input and acquisition. Cambridge: Cambridge University Press.
Stewart, O. T. (2001). The serial verb construction parameter. New York: Garland Press.
Waterworth, J. A., & Talbot, M. (1987). Speech and language-based interaction with machines: Towards the conversational computer. Chichester: Horwood/Halsted.
Zoltan-Ford, E. (1991). How to get people to say and type what computers can understand, International Journal of Man-Machine Studies, 34, 527-547.
Zue, V., Seneff, S., Polifroni, J., Phillips, M., Pao, C., Goodine, D., Goddeau, D., Glass, J., & Brill, E. (1994). PEGASUS: A spoken dialogue interface for on-line air travel planning. Speech Communication, 15, 331-340.
Zue, V., Seneff, S., Glass, J., Hetherington, L., Hurley, E., Meng, H., Pao, C., Polifroni, J., Schloming, R., & Schmid, P. (1997). From interface to content: Translingual access and delivery of on-line information. Proceedings of EUROSPEECH(pp. 100-200).
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2008 Springer Science + Business Media, LLC
About this chapter
Cite this chapter
Stewart, O.T., Blanchard, H.E. (2008). Linguistics and Psycholinguistics in IVR Design. In: Human Factors and Voice Interactive Systems. Signals and Communication Technology. Springer, Boston, MA. https://doi.org/10.1007/978-0-387-68439-0_3
Download citation
DOI: https://doi.org/10.1007/978-0-387-68439-0_3
Publisher Name: Springer, Boston, MA
Print ISBN: 978-0-387-25482-1
Online ISBN: 978-0-387-68439-0
eBook Packages: EngineeringEngineering (R0)