Abstract
Conversation is a natural and intuitive mode of interaction. As humans, we engage all the time in conversation without having to think about how conversation actually works. In this chapter, we examine the key features of conversational interaction that will inform us as we develop conversational interfaces for a range of smart devices. In particular, we describe how utterances in a conversation can be viewed as actions that are performed in the pursuit of a goal; how conversation is structured; how participants in conversation collaborate to make conversation work; what the language of conversation looks like; and the implications for developers of applications that engage in conversational interaction with humans.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Allen JF (1995) Natural language understanding, 2nd edn. Benjamin Cummings Publishing Company Inc., Redwood
Allen JF, Core M (1997) Draft of DAMSL: dialog act markup in several layers. The Multiparty Discourse Group. University of Rochester, Rochester, USA. http://www.cs.rochester.edu/research/cisd/resources/damsl/RevisedManual/. Accessed 20 Jan 2016
Allen JF, Ferguson G, Stent A (2001) An architecture for more realistic conversational systems. In: Proceedings of intelligent user interfaces 2001 (IUI-01), Santa Fe, NM, 14–17 Jan 2001. doi:10.1145/359784.359822
Alexandersson J, Buschbeck-Wolf B, Fujinami T, Maier E, Reithinger N, Schmitz B, Siegel M (1997) Dialog acts in VERBMOBIL-2. Verbmobil report 204, May 1997, DFKI GmbH, Saarbrücken Germany
Allwood J (1976) Linguistic communication as action and cooperation. Gothenburg monographs in linguistics 2. University of Göteborg, Department of Linguistics
Antaki C (2002) An introductory tutorial in conversation analysis. http://www-staff.lboro.ac.uk/~ssca1/sitemenu.htm. Accessed on 26 Jan 2016
Austin JL (1962) How to do things with words. Oxford University Press, Oxford
Bohus D (2007) Error awareness and recovery in conversational spoken language interfaces. Ph.D. dissertation, Carnegie Mellon University, Pittsburgh, PA
Bunt HC (1979) Conversational principles in question-answer dialogs. In: Krallmann D (ed) Zur Theorie der Frage. Narr Verlag, Essen, pp 119–141
Bunt HC (1995) DIT – dynamic interpretation and dialog theory. In: Taylor MM, Neel F, Bouwhuis DG (eds) Proceedings from the second Venaco workshop on multimodal dialog. Benjamins, Amsterdam, pp 139–166
Bunt HC (2009) The DIT++ taxonomy for functional dialog markup. In: Heylen D, Pelachaud C, Catizone R, Traum DR (eds) Proceedings of the AMAAS 2009 workshop towards a standard markup language for embodied dialog acts. Budapest, May 2009, pp 13–24
Bunt HC (2011) Multifunctionality in dialog. Comp Speech Lang 25:222–245. doi:10.1016/j.csl.2010.04.006
Bunt HC, Black W (eds) (2000) Abduction, belief and context in dialog: studies in computational pragmatics. John Benjamins Publishing Company, Amsterdam. doi:10.1075/nlp.1
Bunt HC, Alexandersson J, Choe J-W, Fang AC, HasidaK, PetukhovaV, Popescu-Belis A, Traum DR (2012a) ISO 24617-2: A semantically-based standard for dialog annotation. In: Proceedings of the 8th international conference on language resources and evaluation (LREC 2012), Istanbul, pp 430–437. http://www.lrec-conf.org/proceedings/lrec2012/pdf/180_Paper.pdf. Accessed 2 Mar 2016
Bunt HC, Kipp M, Petukhova V (2012b) Using DiAML and ANVIL for multimodal dialog annotation. In: Proceedings of the 8th international conference on language resources and evaluation (LREC 2012), Istanbul, pp 1301–1308. http://www.lrec-conf.org/proceedings/lrec2012/pdf/1107_Paper.pdf. Accessed 2 Mar 2016
Carletta J, Isard A, Isard S, Kowtko J, Doherty-Sneddon G, Anderson A (1997) The reliability of a dialog structure coding scheme. Comput Linguist 23:13–31. http://dl.acm.org/citation.cfm?id=972686. Accessed 20 Jan 2016
Clark HH (1996) Using language. Cambridge University Press, Cambridge. doi:10.1017/cbo9780511620539
Clark HH, Brennan SE (1991) Grounding in communication. In: Resnick LB, Levine JM, Teasley SD (eds) Perspectives on socially shared cognition. American Psychological Association, Washington, pp 127–149. doi:10.1037/10096-006
Clark HH, Schaefer EF (1989) Contributing to discourse. Cogn Sci 13:259–294. doi:10.1207/s15516709cog1302_7
Cooper R, Larsson S, Matheson C, Poesio M, Traum DR (1999) Coding instructional dialog for information states Trindi project deliverable D1.1. http://www.ling.gu.se/projekt/trindi//publications.html. Accessed 20 Jan 2016
Coupland N, Giles H, Wiemann J (eds) (1991) Miscommunication and problematic talk. Sage Publications, London
Eggins S, Slade D (2005) Analysing casual conversation. Equinox Publishing Ltd., Sheffield
Fernández R (2014) Dialog. In: Mitkov R (ed) The Oxford handbook of computational linguistics, 2nd edn. Oxford University Press, Oxford. doi:10.1093/oxfordhb/9780199573691.013.25
Frampton M (2009) Reinforcement learning in spoken dialog systems: optimising repair strategies. VDM Verlag, Saarbrücken
Frampton M, Lemon O (2005) Reinforcement learning of dialog strategies using the user’s last dialog act. In: Proceedings of 4th IJCAI workshop on knowledge and reasoning in practical dialog systems, Edinburgh. https://pureapps2.hw.ac.uk/portal/en/publications/reinforcement-learning-of-dialog-strategies-using-the-users-last-dialog-act(193e9575–2081-4338-b37a-d7a0c47e9dc9).html. Accessed 20 Jan 2016
Geis ML (2006) Speech acts and conversational interaction. Cambridge University Press, Cambridge
Griol D, Hurtado L, Segarra E, Sanchis E (2008) A statistical approach to spoken dialog systems design and evaluation. Speech Commun 50:666–682. doi:10.1016/j.specom.2008.04.001
Griol D, Callejas Z, López-Cózar R, Riccardi G (2014) A domain-independent statistical methodology for dialog management in spoken dialog systems. Comp Speech Lang 28:743–768. doi:10.1016/j.csl.2013.09.002
Grosz BJ, Sidner CL (1986) Attention, intentions, and the structure of discourse. Comput Linguist 12(3):175–204. http://dl.acm.org/citation.cfm?id=12458. Accessed 20 Jan 2016
Gumperz J (1978) The conversational analysis of interethnic communication. In: Ross EL (ed) Interethnic communication. University of Georgia Press, Athens, pp 13–31
Hastie H, Poesio M, Isard S (2002) Automatically predicting dialog structure using prosodic features. Speech Commun 36(1–2):63–79. doi:10.1016/S0167-6393(01)00026-7
Hayashi M, Raymond G, Sidnell J (eds) (2013) Conversational repair and human understanding. Cambridge University Press, Cambridge
Heeman P, Allen JF (1994) Detecting and correcting speech repairs. In: Proceedings of the 32nd annual meeting of the Association of Computational Linguistics, Las Cruces, pp 295–302. doi:10.3115/981732.981773
Hirschberg J (2002) Communication and prosody: functional aspects of prosody. Speech Commun 36(1–2):31–43. doi:10.1016/S0167-6393(01)00024-3
Hutchby I, Wooffitt R (2008) Conversation analysis. Polity Press, Oxford
Jokinen K, McTear M (2010) Spoken dialog systems. Synthesis lectures on human language technologies. Morgan and Claypool Publishers, San Rafael. doi:10.2200/S00204ED1V01Y200910HLT005
Jurafsky D, Martin JH (2009) Speech and language processing: an introduction to natural language processing, computational linguistics, and speech recognition, 2nd edn. Prentice Hall, Upper Saddle River
Jurafsky D, Shriberg E, Biasca D (1997) Switchboard SWBD-DAMSL shallow-discourse-function annotation coders manual, Draft 13. University of Colorado, Boulder, CO. Institute of Cognitive Science Technical Report 97-02. https://web.stanford.edu/~jurafsky/ws97/manual.august1.html. Accessed 20 Jan 2016
Kipp M (2012) Multimedia annotation, querying and analysis in ANVIL. In: Maybury M (ed) Multimedia information extraction. IEEE Computer Society Press. doi:10.1002/9781118219546.ch21
Kowtko J, Isard S, Doherty GM (1993) Conversational games within dialog. Research paper HCRC/RP-31, Human Communication Research Centre, University of Edinburgh
Larsson S, Traum DR (2000) Information state and dialog management in the TRINDI dialog move engine toolkit. Nat Lang Eng 6(3–4):323–340. doi:10.1017/S1351324900002539
Levinson SC (1983) Pragmatics. Cambridge University Press, Cambridge
Lewis JR (2011) Practical speech user interface design. CRC Press, Boca Raton. doi:10.1201/b10461
Matheson C, Poesio M, Traum DR (2001) Modelling grounding and discourse obligations using update rules. In: Proceedings of the first annual meeting of the North American chapter of the ACL, Seattle, April 2001
McTear, M (2008) Handling miscommunication: why bother? In: Dybkjaer L, Minker W (eds) Recent trends in discourse and dialog. Springer, New York, pp 101–122. doi:10.1007/978-1-4020-6821-8_5
Narayanan S, Georgiou PG (2013) Behavioral signal processing: deriving human behavioral informatics from speech and language. Proc IEEE 101(5):1203–1233. doi:10.1109/JPROC.2012.2236291
Nass C, Brave S (2004) Wired for speech: how voice activates and advances the human-computer relationship. MIT Press, Cambridge
Nöth E, Batliner A, Warnke V, Haas J, Boros M, Buckow J, Huber R, Gallwitz F, Nutt M, Niemann H (2002) On the use of prosody in automatic dialog understanding. Speech Commun 36(1–2):45–62. doi:10.1016/S0167-6393(01)00025-5
Pentland A (2007) Social signal processing. Signal Process Mag 24(4):108–111. doi: 10.1109/MSP.2007.4286569
Rudnicky AJ, Wu X (1999) An agenda-based dialog management architecture for spoken language systems. In: Proceedings of IEEE automatic speech recognition and understanding workshop (ASRU99), Chichester, UK, pp 3–7. http://www.cs.cmu.edu/~xw/asru99-agenda.pdf. Accessed 20 Jan 2016
Sacks H (1984) On doing ‘being ordinary’. In: Atkinson JM, Heritage JC (eds) Structures of social action: studies in conversation analysis. Cambridge University Press, Cambridge. doi:10.1017/CBO9780511665868.024
Sacks H, Schegloff EA, Jefferson G (1974) A simplest systematics for the organization of turn-taking for conversation. Language 50(4):696–735. doi:10.1353/lan.1974.0010
Schegloff EA (1968) Sequencing in conversational openings. Am Anthropol 70:1075–1095. doi:10.1525/aa.1968.70.6.02a00030
Schegloff EA (1982) Discourse as an interactional achievement: some uses of “uh huh” and other things that come between sentences. In: Tannen D (ed) Analysing discourse: text and talk. Georgetown University Roundtable on Languages and Linguistics 1981, Georgetown University Press, Washington, DC, pp 71–93
Schegloff EA, Sacks H (1973) Opening up closings. Semiotica 8(4):289–327. doi:10.1515/semi.1973.8.4.289
Schegloff EA, Jefferson G, Sacks H (1977) The preference for self-correction in the organisation of repair in conversation. Language 53:361–382. doi:10.2307/413107
Schröder M, Bevacqua E, Cowie R, Eyben F, Gunes H, Heylen D, ter Maat M, McKeown G, Pammi S, Pantic M, Pelachaud C, Schuller B, de Sevin E, Valstar M, Wöllmer M (2012) Building autonomous sensitive artificial listeners. IEEE Trans Affect Comput 3(2):165–183. doi:10.1109/T-AFFC.2011.34
Schuller B, Batliner A (2013) Computational paralinguistics: emotion, affect and personality in speech and language processing. Wiley, Chichester. doi:10.1002/9781118706664
Searle JR (1969) Speech acts. Cambridge University Press, Cambridge. doi:10.1017/CBO9781139173438
Searle JR (ed) (2013) Speech act theory and pragmatics. Springer, New York. doi:10.1007/978-94-009-8964-1
Sidnell J (2010) Conversation analysis: an introduction. Wiley-Blackwell, Chichester
Sidnell J, Stivers, T (eds) (2014) The handbook of conversation analysis. Wiley-Blackwell, Chichester. doi:10.1002/9781118325001
Sinclair JM, Coulthard M (1975) Towards an analysis of discourse. Oxford University Press, Oxford
Skantze G (2007) Error handling in spoken dialog systems—managing uncertainty, grounding and miscommunication. Ph.D. dissertation, KTH, Stockholm, Sweden
Skantze G, Hjalmarsson A (2013) Towards incremental speech generation in conversational systems. Comp Speech Lang 27(1):243–262. doi:10.1016/j.csl.2012.05.004
Stent A (2002) A conversation acts model for generating spoken dialog contributions. Comp Speech Lang 16:313–352. doi:10.1016/s0885-2308(02)00009-8
Tannen D (2001) You just don’t understand: women and men in conversation. Ballentine Books, New York
Traum DR (1994) A computational theory of grounding in natural language conversation. Ph.D. dissertation, Department of Computer Science, University of Rochester, New York
Traum DR (2000) 20 questions for dialog act taxonomies. J Seman 17(1):7–30. doi:10.1093/jos/17.1.7
Traum DR, Hinkelmann EA (1992) Conversation acts in task-oriented spoken dialog. Comput Intell 8(3):575–599. doi:10.1111/j.1467-8640.1992.tb00380.x
Vinciarelli A, Pantic M, Bourlard H (2009) Social signal processing: survey of an emerging domain. Image Vis Compu 27(12):1743–1759. doi:10.1016/j.imavis.2008.11.007
Webber B, Egg M, Kordoni V (2012) Discourse structure and language technology. Nat Lang Eng 18(4):437–490. doi:10.1017/S1351324911000337
Wittgenstein L (1958) Philosophical investigations. Blackwell, Oxford
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this chapter
Cite this chapter
McTear, M., Callejas, Z., Griol, D. (2016). Toward a Technology of Conversation. In: The Conversational Interface. Springer, Cham. https://doi.org/10.1007/978-3-319-32967-3_3
Download citation
DOI: https://doi.org/10.1007/978-3-319-32967-3_3
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-32965-9
Online ISBN: 978-3-319-32967-3
eBook Packages: EngineeringEngineering (R0)