Skip to main content

Toward a Technology of Conversation

  • Chapter
  • First Online:
The Conversational Interface

Abstract

Conversation is a natural and intuitive mode of interaction. As humans, we engage all the time in conversation without having to think about how conversation actually works. In this chapter, we examine the key features of conversational interaction that will inform us as we develop conversational interfaces for a range of smart devices. In particular, we describe how utterances in a conversation can be viewed as actions that are performed in the pursuit of a goal; how conversation is structured; how participants in conversation collaborate to make conversation work; what the language of conversation looks like; and the implications for developers of applications that engage in conversational interaction with humans.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 109.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 139.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 199.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    http://www.cs.rochester.edu/research/trains/annotation.

References

  • Allen JF (1995) Natural language understanding, 2nd edn. Benjamin Cummings Publishing Company Inc., Redwood

    MATH  Google Scholar 

  • Allen JF, Core M (1997) Draft of DAMSL: dialog act markup in several layers. The Multiparty Discourse Group. University of Rochester, Rochester, USA. http://www.cs.rochester.edu/research/cisd/resources/damsl/RevisedManual/. Accessed 20 Jan 2016

  • Allen JF, Ferguson G, Stent A (2001) An architecture for more realistic conversational systems. In: Proceedings of intelligent user interfaces 2001 (IUI-01), Santa Fe, NM, 14–17 Jan 2001. doi:10.1145/359784.359822

  • Alexandersson J, Buschbeck-Wolf B, Fujinami T, Maier E, Reithinger N, Schmitz B, Siegel M (1997) Dialog acts in VERBMOBIL-2. Verbmobil report 204, May 1997, DFKI GmbH, Saarbrücken Germany

    Google Scholar 

  • Allwood J (1976) Linguistic communication as action and cooperation. Gothenburg monographs in linguistics 2. University of Göteborg, Department of Linguistics

    Google Scholar 

  • Antaki C (2002) An introductory tutorial in conversation analysis. http://www-staff.lboro.ac.uk/~ssca1/sitemenu.htm. Accessed on 26 Jan 2016

  • Austin JL (1962) How to do things with words. Oxford University Press, Oxford

    Google Scholar 

  • Bohus D (2007) Error awareness and recovery in conversational spoken language interfaces. Ph.D. dissertation, Carnegie Mellon University, Pittsburgh, PA

    Google Scholar 

  • Bunt HC (1979) Conversational principles in question-answer dialogs. In: Krallmann D (ed) Zur Theorie der Frage. Narr Verlag, Essen, pp 119–141

    Google Scholar 

  • Bunt HC (1995) DIT – dynamic interpretation and dialog theory. In: Taylor MM, Neel F, Bouwhuis DG (eds) Proceedings from the second Venaco workshop on multimodal dialog. Benjamins, Amsterdam, pp 139–166

    Google Scholar 

  • Bunt HC (2009) The DIT++ taxonomy for functional dialog markup. In: Heylen D, Pelachaud C, Catizone R, Traum DR (eds) Proceedings of the AMAAS 2009 workshop towards a standard markup language for embodied dialog acts. Budapest, May 2009, pp 13–24

    Google Scholar 

  • Bunt HC (2011) Multifunctionality in dialog. Comp Speech Lang 25:222–245. doi:10.1016/j.csl.2010.04.006

    Google Scholar 

  • Bunt HC, Black W (eds) (2000) Abduction, belief and context in dialog: studies in computational pragmatics. John Benjamins Publishing Company, Amsterdam. doi:10.1075/nlp.1

    Google Scholar 

  • Bunt HC, Alexandersson J, Choe J-W, Fang AC, HasidaK, PetukhovaV, Popescu-Belis A, Traum DR (2012a) ISO 24617-2: A semantically-based standard for dialog annotation. In: Proceedings of the 8th international conference on language resources and evaluation (LREC 2012), Istanbul, pp 430–437. http://www.lrec-conf.org/proceedings/lrec2012/pdf/180_Paper.pdf. Accessed 2 Mar 2016

  • Bunt HC, Kipp M, Petukhova V (2012b) Using DiAML and ANVIL for multimodal dialog annotation. In: Proceedings of the 8th international conference on language resources and evaluation (LREC 2012), Istanbul, pp 1301–1308. http://www.lrec-conf.org/proceedings/lrec2012/pdf/1107_Paper.pdf. Accessed 2 Mar 2016

  • Carletta J, Isard A, Isard S, Kowtko J, Doherty-Sneddon G, Anderson A (1997) The reliability of a dialog structure coding scheme. Comput Linguist 23:13–31. http://dl.acm.org/citation.cfm?id=972686. Accessed 20 Jan 2016

  • Clark HH (1996) Using language. Cambridge University Press, Cambridge. doi:10.1017/cbo9780511620539

  • Clark HH, Brennan SE (1991) Grounding in communication. In: Resnick LB, Levine JM, Teasley SD (eds) Perspectives on socially shared cognition. American Psychological Association, Washington, pp 127–149. doi:10.1037/10096-006

  • Clark HH, Schaefer EF (1989) Contributing to discourse. Cogn Sci 13:259–294. doi:10.1207/s15516709cog1302_7

    Article  Google Scholar 

  • Cooper R, Larsson S, Matheson C, Poesio M, Traum DR (1999) Coding instructional dialog for information states Trindi project deliverable D1.1. http://www.ling.gu.se/projekt/trindi//publications.html. Accessed 20 Jan 2016

  • Coupland N, Giles H, Wiemann J (eds) (1991) Miscommunication and problematic talk. Sage Publications, London

    Google Scholar 

  • Eggins S, Slade D (2005) Analysing casual conversation. Equinox Publishing Ltd., Sheffield

    Google Scholar 

  • Fernández R (2014) Dialog. In: Mitkov R (ed) The Oxford handbook of computational linguistics, 2nd edn. Oxford University Press, Oxford. doi:10.1093/oxfordhb/9780199573691.013.25

  • Frampton M (2009) Reinforcement learning in spoken dialog systems: optimising repair strategies. VDM Verlag, Saarbrücken

    Google Scholar 

  • Frampton M, Lemon O (2005) Reinforcement learning of dialog strategies using the user’s last dialog act. In: Proceedings of 4th IJCAI workshop on knowledge and reasoning in practical dialog systems, Edinburgh. https://pureapps2.hw.ac.uk/portal/en/publications/reinforcement-learning-of-dialog-strategies-using-the-users-last-dialog-act(193e9575–2081-4338-b37a-d7a0c47e9dc9).html. Accessed 20 Jan 2016

  • Geis ML (2006) Speech acts and conversational interaction. Cambridge University Press, Cambridge

    Google Scholar 

  • Griol D, Hurtado L, Segarra E, Sanchis E (2008) A statistical approach to spoken dialog systems design and evaluation. Speech Commun 50:666–682. doi:10.1016/j.specom.2008.04.001

    Article  Google Scholar 

  • Griol D, Callejas Z, López-Cózar R, Riccardi G (2014) A domain-independent statistical methodology for dialog management in spoken dialog systems. Comp Speech Lang 28:743–768. doi:10.1016/j.csl.2013.09.002

    Article  Google Scholar 

  • Grosz BJ, Sidner CL (1986) Attention, intentions, and the structure of discourse. Comput Linguist 12(3):175–204. http://dl.acm.org/citation.cfm?id=12458. Accessed 20 Jan 2016

  • Gumperz J (1978) The conversational analysis of interethnic communication. In: Ross EL (ed) Interethnic communication. University of Georgia Press, Athens, pp 13–31

    Google Scholar 

  • Hastie H, Poesio M, Isard S (2002) Automatically predicting dialog structure using prosodic features. Speech Commun 36(1–2):63–79. doi:10.1016/S0167-6393(01)00026-7

    Article  MATH  Google Scholar 

  • Hayashi M, Raymond G, Sidnell J (eds) (2013) Conversational repair and human understanding. Cambridge University Press, Cambridge

    Google Scholar 

  • Heeman P, Allen JF (1994) Detecting and correcting speech repairs. In: Proceedings of the 32nd annual meeting of the Association of Computational Linguistics, Las Cruces, pp 295–302. doi:10.3115/981732.981773

  • Hirschberg J (2002) Communication and prosody: functional aspects of prosody. Speech Commun 36(1–2):31–43. doi:10.1016/S0167-6393(01)00024-3

    Google Scholar 

  • Hutchby I, Wooffitt R (2008) Conversation analysis. Polity Press, Oxford

    Google Scholar 

  • Jokinen K, McTear M (2010) Spoken dialog systems. Synthesis lectures on human language technologies. Morgan and Claypool Publishers, San Rafael. doi:10.2200/S00204ED1V01Y200910HLT005

    Google Scholar 

  • Jurafsky D, Martin JH (2009) Speech and language processing: an introduction to natural language processing, computational linguistics, and speech recognition, 2nd edn. Prentice Hall, Upper Saddle River

    Google Scholar 

  • Jurafsky D, Shriberg E, Biasca D (1997) Switchboard SWBD-DAMSL shallow-discourse-function annotation coders manual, Draft 13. University of Colorado, Boulder, CO. Institute of Cognitive Science Technical Report 97-02. https://web.stanford.edu/~jurafsky/ws97/manual.august1.html. Accessed 20 Jan 2016

  • Kipp M (2012) Multimedia annotation, querying and analysis in ANVIL. In: Maybury M (ed) Multimedia information extraction. IEEE Computer Society Press. doi:10.1002/9781118219546.ch21

    Google Scholar 

  • Kowtko J, Isard S, Doherty GM (1993) Conversational games within dialog. Research paper HCRC/RP-31, Human Communication Research Centre, University of Edinburgh

    Google Scholar 

  • Larsson S, Traum DR (2000) Information state and dialog management in the TRINDI dialog move engine toolkit. Nat Lang Eng 6(3–4):323–340. doi:10.1017/S1351324900002539

    Article  Google Scholar 

  • Levinson SC (1983) Pragmatics. Cambridge University Press, Cambridge

    Google Scholar 

  • Lewis JR (2011) Practical speech user interface design. CRC Press, Boca Raton. doi:10.1201/b10461

    Google Scholar 

  • Matheson C, Poesio M, Traum DR (2001) Modelling grounding and discourse obligations using update rules. In: Proceedings of the first annual meeting of the North American chapter of the ACL, Seattle, April 2001

    Google Scholar 

  • McTear, M (2008) Handling miscommunication: why bother? In: Dybkjaer L, Minker W (eds) Recent trends in discourse and dialog. Springer, New York, pp 101–122. doi:10.1007/978-1-4020-6821-8_5

  • Narayanan S, Georgiou PG (2013) Behavioral signal processing: deriving human behavioral informatics from speech and language. Proc IEEE 101(5):1203–1233. doi:10.1109/JPROC.2012.2236291

    Article  Google Scholar 

  • Nass C, Brave S (2004) Wired for speech: how voice activates and advances the human-computer relationship. MIT Press, Cambridge

    Google Scholar 

  • Nöth E, Batliner A, Warnke V, Haas J, Boros M, Buckow J, Huber R, Gallwitz F, Nutt M, Niemann H (2002) On the use of prosody in automatic dialog understanding. Speech Commun 36(1–2):45–62. doi:10.1016/S0167-6393(01)00025-5

    Article  MATH  Google Scholar 

  • Pentland A (2007) Social signal processing. Signal Process Mag 24(4):108–111. doi: 10.1109/MSP.2007.4286569

    Google Scholar 

  • Rudnicky AJ, Wu X (1999) An agenda-based dialog management architecture for spoken language systems. In: Proceedings of IEEE automatic speech recognition and understanding workshop (ASRU99), Chichester, UK, pp 3–7. http://www.cs.cmu.edu/~xw/asru99-agenda.pdf. Accessed 20 Jan 2016

  • Sacks H (1984) On doing ‘being ordinary’. In: Atkinson JM, Heritage JC (eds) Structures of social action: studies in conversation analysis. Cambridge University Press, Cambridge. doi:10.1017/CBO9780511665868.024

  • Sacks H, Schegloff EA, Jefferson G (1974) A simplest systematics for the organization of turn-taking for conversation. Language 50(4):696–735. doi:10.1353/lan.1974.0010

    Article  Google Scholar 

  • Schegloff EA (1968) Sequencing in conversational openings. Am Anthropol 70:1075–1095. doi:10.1525/aa.1968.70.6.02a00030

    Article  Google Scholar 

  • Schegloff EA (1982) Discourse as an interactional achievement: some uses of “uh huh” and other things that come between sentences. In: Tannen D (ed) Analysing discourse: text and talk. Georgetown University Roundtable on Languages and Linguistics 1981, Georgetown University Press, Washington, DC, pp 71–93

    Google Scholar 

  • Schegloff EA, Sacks H (1973) Opening up closings. Semiotica 8(4):289–327. doi:10.1515/semi.1973.8.4.289

    Article  Google Scholar 

  • Schegloff EA, Jefferson G, Sacks H (1977) The preference for self-correction in the organisation of repair in conversation. Language 53:361–382. doi:10.2307/413107

    Article  Google Scholar 

  • Schröder M, Bevacqua E, Cowie R, Eyben F, Gunes H, Heylen D, ter Maat M, McKeown G, Pammi S, Pantic M, Pelachaud C, Schuller B, de Sevin E, Valstar M, Wöllmer M (2012) Building autonomous sensitive artificial listeners. IEEE Trans Affect Comput 3(2):165–183. doi:10.1109/T-AFFC.2011.34

    Article  Google Scholar 

  • Schuller B, Batliner A (2013) Computational paralinguistics: emotion, affect and personality in speech and language processing. Wiley, Chichester. doi:10.1002/9781118706664

    Google Scholar 

  • Searle JR (1969) Speech acts. Cambridge University Press, Cambridge. doi:10.1017/CBO9781139173438

  • Searle JR (ed) (2013) Speech act theory and pragmatics. Springer, New York. doi:10.1007/978-94-009-8964-1

    Google Scholar 

  • Sidnell J (2010) Conversation analysis: an introduction. Wiley-Blackwell, Chichester

    Google Scholar 

  • Sidnell J, Stivers, T (eds) (2014) The handbook of conversation analysis. Wiley-Blackwell, Chichester. doi:10.1002/9781118325001

    Google Scholar 

  • Sinclair JM, Coulthard M (1975) Towards an analysis of discourse. Oxford University Press, Oxford

    Google Scholar 

  • Skantze G (2007) Error handling in spoken dialog systems—managing uncertainty, grounding and miscommunication. Ph.D. dissertation, KTH, Stockholm, Sweden

    Google Scholar 

  • Skantze G, Hjalmarsson A (2013) Towards incremental speech generation in conversational systems. Comp Speech Lang 27(1):243–262. doi:10.1016/j.csl.2012.05.004

    Article  Google Scholar 

  • Stent A (2002) A conversation acts model for generating spoken dialog contributions. Comp Speech Lang 16:313–352. doi:10.1016/s0885-2308(02)00009-8

    Article  Google Scholar 

  • Tannen D (2001) You just don’t understand: women and men in conversation. Ballentine Books, New York

    Google Scholar 

  • Traum DR (1994) A computational theory of grounding in natural language conversation. Ph.D. dissertation, Department of Computer Science, University of Rochester, New York

    Google Scholar 

  • Traum DR (2000) 20 questions for dialog act taxonomies. J Seman 17(1):7–30. doi:10.1093/jos/17.1.7

    Article  Google Scholar 

  • Traum DR, Hinkelmann EA (1992) Conversation acts in task-oriented spoken dialog. Comput Intell 8(3):575–599. doi:10.1111/j.1467-8640.1992.tb00380.x

    Article  Google Scholar 

  • Vinciarelli A, Pantic M, Bourlard H (2009) Social signal processing: survey of an emerging domain. Image Vis Compu 27(12):1743–1759. doi:10.1016/j.imavis.2008.11.007

    Article  Google Scholar 

  • Webber B, Egg M, Kordoni V (2012) Discourse structure and language technology. Nat Lang Eng 18(4):437–490. doi:10.1017/S1351324911000337

    Article  Google Scholar 

  • Wittgenstein L (1958) Philosophical investigations. Blackwell, Oxford

    MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Michael McTear .

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this chapter

Cite this chapter

McTear, M., Callejas, Z., Griol, D. (2016). Toward a Technology of Conversation. In: The Conversational Interface. Springer, Cham. https://doi.org/10.1007/978-3-319-32967-3_3

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-32967-3_3

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-32965-9

  • Online ISBN: 978-3-319-32967-3

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics