Abstract
Multimodal systems where the user interacts with one or more autonomous agents in a tridimensional environment must be able to interpret multimodal input from a user and react to it in a coherent way. The treatment of natural language dialogues in computer systems can be divided into three components: dialogue management, discourse interpretation and response generation. Dialogue management consists in implementing strategies that drive the dialogue, such as who can take the initiative and who controls the dialogue (the user, the system or both of them) and how the system must react to error situations. The choice of strategy depends, inter alia, on the application of the system and on the degree of robustness of the available speech recognition software.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Ahrenberg, L., Dahlbäck, N. and Jönsson, A. (1990) Discourse representation and discourse management for a natural language dialogue system. In Proceedings of the Second Nordic Conference on Text Comprehension in Man and Machine,Täby, Stockholm.
Allen, J. F., Schubert, L. K., Ferguson, G., Heeman, P., Hwang, C., Kato, T., Light, M., Martin, N., Miller, B., Poesio, M. and Traum, D. R. (1995) The TRAINS project: a case study in building a conversa-tional planning agent. Journal of Experimental and Theoretical Artificial Intelligence 7: 7–48.
Arens, Y., Hovy, E. and Vossers, M. (1993) On the knowledge underlying multimedia presentations. In Intelligent Multimedia Interfaces (ed. Maybury, M. T. ). Cambridge: AAAI Press, pp. 280–306.
Asher, N. (1993) Reference to Abstract Objects in Discourse. Dordrecht: Kluwer Academic.
Asher, N. and Lascarides, A. (1994) Intentions and information in discourse. In Proceedings of the 32 nd Annual Meeting of the Association for Computational Linguistics, Las Cruces, USA, pp. 34–41.
Asher, N. and Lascarides, A. (1998) Questions in dialogues Linguistics and Philosophy 3 (3): 237–309.
Asher, N. and Lascarides, A. (1999) Cognitive states, discourse structure and the content of dialogue. In Preproceedings of Amstelogue ‘89. Workshop on the Semantics and Pragmatics of Dialogue, Amsterdam University.
Barwise, J. and Perry, J. (1983) Situations and Attitudes. Cambridge, MA: MIT Press.
Beskow, J., Ellenius, K. and McGlashan, S. (1997) Olga - A dialogue system with an animated talking agent. In Proceedings of EUROSPEECH,Rhodes, Greece.
Bunt, H. (1995) Dialogue control functions and interaction design. In Dialogue in Instruction (eds. R. J. Benn, M. Baker and M. Reiner ). Heidelberg: Springer-Verlag, pp. 197–214.
Carlson, R., Hunnicutt, S. and Gustafsson, J. (1995) Dialog management in the Waxholm system. In Proceedings of Spoken Dialogue Systems,Vigse.
Cassell, J. and Vilhjâlmsson, H. (1999) Fully embodied conversational avatars: making communicative behaviours autonomous. Autonomous Agents and Multi-Agent Systems 2 (1): 45–64.
Clark, H. H. and Schaefer, E. F. (1989) Contributing to discourse. Cognitive Science 3: 259–294.
Cohen, P. R. and Levesque, H. J. (1990) Rational Interaction as the Basis for Communication. In Inten-tions in Communication (eds. P. R. Cohen, J. Morgan and M. E. Pollack ). Cambridge, MA: MIT Press, pp. 221–255.
Fikes, R. and Nilsson, N. (1971) STRIPS: a new approach to the application of theorem proving to problem solving. Artificial Intelligence 2: 189–208.
Glass, J., Polifroni, J. and Seneff, S. (1995) Multilingual spoken-language understanding in the MIT Voyager system. Speech Communication 17: 1–18.
Grosz, B. J. and Sidner, C. L. (1986) Attention, intentions, and the structure of discourse. Computational Linguistics 12 (3): 175–284.
Grosz, B., Joshi, A. K. and Weinstein, S. (1986) Towards a computational theory of discourse interpretation, Unpublished MS.
Halliday, M. and Hasan, R. (1976) Cohesion in English. London: Longman.
Hobbs, J. R. (1983) Why is discourse coherent?. In Coherence In Natural-Language Texts (ed. F. Neubauer ). Hamburg: Helmut Buske Verlag, pp. 29–70.
Hobbs, J., Stickel, M., Appelt, D. and Martin, P. (1993) Interpretation as Abduction. Artificial Intelligence 3 (1–2): 69–142.
Johnston, M. (1998) Multimodal unification-based grammars. In AAAI Workshop on Representations for Multimodal Human-Computer Interaction, Madison, WI. Menlo Park, CA: AAAI Press, p. 31.
Jönsson, A. (1997) A model for habitable and efficient dialogue management. Natural Language Engi-neering 2, 3: 103–122.
Kameyama, M. (1998) Intrasentential centering: a case study. In Centering Theory in Discourse (eds. M. Walker, A. Joshi and E. Prince ). Oxford: Oxford University Press, pp. 89–112.
Kamp, H. (1981) A theory of truth and semantic representation. In Formal Methods in the Study of Language (eds. J. A. G. Groenendijk, T. M. V. Janssen and M. B. J. Stokhof), Vol. 1. Foris, pp. 277322.
Kamp, H. and Reyle, U. (1993) From Discourse to Logic: Introduction to Modeltheoretic Semantics of Natural Language, Formal Logic and Discourse Representation Theory. Boston: Kluwer Academic.
Lau, R., Flammia, G., Pao, C. and Zue, V. (1997) WebGALAXY: beyond point and click–a conversational interface to a browser. In Proceedings of the Sixth International World Wide Web Conference (eds. M. Gemeresereth and A. Patterson ), Santa Clara, CA, pp. 119–128.
Litman, D. J. and Allen, J. F. (1990) Discourse processing and commonsense plans. In Intentions in Communication (eds. P. R. Cohen, J. Morgan and M. E. Pollack ). Cambridge, MA: MIT Press, pp. 365–388.
Mann, W. and Thompson, S. A. (1987) Rhetorical structure theory: description and construction of text structures. In Natural Language Generation (ed. G. Kempen ). Boston, MA: Martinus Nijhoff, pp. 85–95.
Martin, J.-C. (1997) Towards “intelligent” cooperation between modalities. The example of a system enabling multimodal interaction with a map. In Proceedings of the IJCAI’97 Workshop on “Intelligent Multimodal Systems”. Nagoya, Japan.
McGlashan, S. (1996) Towards multimodal dialogue management. In Proceedings of 11th Twente Workshop on Language in Texhnology,Enschede, The Netherlands.
McGlashan, S., Bilange, E., Fraser, N., Gilbert, N., Heisterkamp, P. and Youd, N. (1992) Dialogue management for telephone information systems. In Proceedings of the 3rd Conference on Applied Natural Language Processing, Trento, Italy, pp. 245–246.
Moore, J. D. and Pollack, M E. (1992) A problem for RST: the need for multi-level discourse analysis. Computational Linguistics 18 (4): 537–544.
Newell, A. F., Arnott, J. L., Carter, K. and Cruickshank, G. (1990) Listening typewriter simulation studies. International Journal of Man-Machine Studies 3: 1–19.
Oviatt, S., DeAngeli, A. and Kuhn, K. (1997) Integration and synchronization of input modes during multimodal human-computer interaction. In Proceedings of Conference on Human Factors in Computing Systems. New York: ACM Press.
Polanyi, L. (1988) A formal model of the structure of discourse. Journal of Pragmatics 2 (5 6): 601–638.
Searle, J. R. (1975) A taxonomy of illocutionary acts. In Language, Mind, and Knowledge (ed. K. Gundersond ). Minneapolis, MN: University of Minnesota Press, pp. 344–369.
Sinclair, J., Hanks, P., Fox, G., Moon, R. and Stock, P. (eds.) (1987) Collins Cobuild English Language Dictionary, London: HarperCollins.
Traum, D. R. and Hinkelman, E. A. (1992) Conversation acts in task-oriented spoken dialogue. Computational Intelligence 3: 575–599.
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2001 Springer-Verlag London
About this chapter
Cite this chapter
Navarretta, C. (2001). Exploiting Recent Research on Dialogue to Model Verbal Communication in Staging. In: Qvortrup, L. (eds) Virtual Interaction: Interaction in Virtual Inhabited 3D Worlds. Springer, London. https://doi.org/10.1007/978-1-4471-3698-9_13
Download citation
DOI: https://doi.org/10.1007/978-1-4471-3698-9_13
Publisher Name: Springer, London
Print ISBN: 978-1-84996-863-8
Online ISBN: 978-1-4471-3698-9
eBook Packages: Springer Book Archive