Patterns of Synchronization of Non-verbal Cues and Speech in ECAs: Towards a More “Natural” Conversational Agent

Rossini, Nicla

doi:10.1007/978-3-642-18184-9_9

Nicla Rossini²¹

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 6456))

1182 Accesses
20 Altmetric

Abstract

This paper presents an analysis of the verbal and non-verbal cues of Conversational Agents, with a special focus on REA and GRETA, in order to allow further research aimed at correcting some traits of their performance still considered unnatural by their final users. Despite the striking performance of new generation ECA, some important features make these conversational agents unreliable to the users, who usually prefer interacting with a classical computer for information retrieval. The users’ preference can be due to several factors, such as the quality of speech synthesis, or the inevitable unnaturalness of the graphics animating the avatar. Apart from the unavoidable traits that can render ECAs unnatural to the ultimate users, instances of poor synchronization between verbal and non-verbal behaviour may contribute to unfavourable results. An instance of synchronization patterns between non-verbal cues and speech is here analysed and re-applied to the basic architecture of an ECA in order to improve the ECA’s verbal and non-verbal synchronization. A proposal for future inquiry aimed at creating alternative model for the ultimate Mp4 output is also proposed, for further development in this field.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Hartmann, B., Mancini, M., Pelachaud, C.: Implementing Expressive Gesture Synthesis for Embodied Conversational Agents. In: Gibet, S., Courty, N., Kamp, J.-F. (eds.) GW 2005. LNCS (LNAI), vol. 3881, pp. 188–199. Springer, Heidelberg (2006)
Chapter Google Scholar
Cassell, J., Stocky, T., Bickmore, T., Gao, Y., Nakano, Y., Ryokai, K., Tversky, D., Vaucelle, C., Vilhjálmsson, H.: MACK: Media lab Autonomous Conversational Kiosk. In: Proceedings of IMAGINA 2002, Monte Carlo, January 12-15 (2002), http://www.media.mit.edu/gnl/pubs/imagina02.pdf
Cassell, J.: Trading spaces: Gesture Morphology and Semantics in Humans and Virtual Humans. In: Second ISGS Conference “Interacting bodies”. École normale supérieure Lettres et Sciences humaines Lyon - France, June 15-18 (2005)
Google Scholar
Cassell, J., Bickmore, T., Billinghurst, M., Campbell, L., Chang, K., Vilhjálmsson, Yan, H.: Embodiment in Conversational Interfaces: Rea. In: Proceedings of the CHI 1999 Conference, Pittsburgh, PA, pp. 520–527 (1999)
Google Scholar
Mancini, M., Bresin, R., Pelachaud, C.: An expressive virtual agent head driven by music performance. IEEE Transactions on Audio, Speech and Language Processing 15(6), 1833–1841 (2007)
Article Google Scholar
Cassell, J., Nakano, Y., Bickmore, T., Sidner, C., Rich, C.: Annotating and Generating Posture from Discourse Structure in Embodied Conversational Agents. In: Workshop on Representing, Annotating, and Evaluating Non-Verbal and Verbal Communicative Acts to Achieve Contextual Embodied Agents, Autonomous Agents 2001 Conference, Montreal, Quebec, May 29 (2001), http://www.ccs.neu.edu/home/bickmore/publications/agents01.pdf
Poggi, I., Pelachaud, C.: Performative facial expressions in animated ‘faces’. Speech Communication 26, 5–21 (1998)
Article Google Scholar
Niewiadomski, R., Ochs, M., Pelachaud, C.: Expressions of Empathy in ECAs. In: Prendinger, H., Lester, J.C., Ishizuka, M. (eds.) IVA 2008. LNCS (LNAI), vol. 5208, pp. 37–44. Springer, Heidelberg (2008)
Chapter Google Scholar
Rossini, N.: The analysis of gesture: Establishing a set of parameters. In: Camurri, A., Volpe, G. (eds.) GW 2003. LNCS (LNAI), vol. 2915, pp. 124–131. Springer, Heidelberg (2004)
Chapter Google Scholar
McNeill, D.: Hand and Mind. What Gestures Reveal about Thought. University of Chicago Press, Chicago (1992)
Google Scholar
Eibl-Eibesfeldt, I.: Similarities and differences between cultures in expressive movements. In: Hinde, A. (ed.) Non-verbal Communication, pp. 297–312. Cambridge University Press, Cambridge (1972)
Google Scholar
Kita, S., van Gijn, I., van der Hulst, H.: The non-linguistic status of the Symmetry Condition in signed languages: Evidence from a comparison from signs and speech-accompanying representational gestures (in progress)
Google Scholar
Rossini, N.: Sociolinguistics in gesture. How about the Mano a Borsa? In: Intercultural Communication Studies, XIII, 3, pp. 144–154; Proceedings of the 9th International Conference on Cross-Cultural Communication (2004)
Google Scholar
Rossini, N.: Gesture and its cognitive origin: Why do we gesture? Experiments on hearing and deaf people. Università di Pavia Ph.D. thesis (2004)
Google Scholar
Thies, A.: First the hand, then the word: On gestural displacement in non-native English speech. Universität Bielefeld Ph.D. Thesis (2003)
Google Scholar
Gibbon, D.: Modelling gesture as speech: a linguistic approach. In: Proceedings of GESPIN 2009, Conference on Gestures and Speech in Interaction, Poznan (September 2009) (to appear)
Google Scholar
Rossini, N.: Il gesto. Gestualità e tratti non verbali in interazioni diadiche. Bologna, Pitagora (2009)
Google Scholar

Download references

Author information

Authors and Affiliations

Dipartimento di Studi Umanistici, Università del Piemonte Orientale, Li.Co.T.T.- Palazzo Tartara, Via G. Ferraris 109, I-13100, Vercelli, Italy
Nicla Rossini

Authors

Nicla Rossini
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Institute for Advanced Scientific Studies, Second University of Naples, and IIASS, Via Pellegrino 19, 84019, Vietri sul Mare (SA), Italy
Anna Esposito
Istituto Nazionale di Geofisica e Vulcanologia, Osservatorio Vesuviano, Via Diocleziano 328, 80124, Napoli, Italy
Antonietta M. Esposito
Dipartemento di Ingegneria dell’ Informazione, Seconda Università di Napoli, Via Roma 29, 81031, Aversa (CE), Italy
Raffaele Martone
Department of Humanities and Social Sciences, Anatolia College/ACT, Kennedy Street, 55510, Pylaia, Greece
Vincent C. Müller
Departmnet of Physics "E.R. Caoamoeööp", University of Salerno and IIASS, International Institute for Advanced Scientific Studies, 84081, Baronissi (SA), Italy
Gaetano Scarpetta

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Rossini, N. (2011). Patterns of Synchronization of Non-verbal Cues and Speech in ECAs: Towards a More “Natural” Conversational Agent. In: Esposito, A., Esposito, A.M., Martone, R., Müller, V.C., Scarpetta, G. (eds) Toward Autonomous, Adaptive, and Context-Aware Multimodal Interfaces. Theoretical and Practical Issues. Lecture Notes in Computer Science, vol 6456. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-18184-9_9

Download citation

DOI: https://doi.org/10.1007/978-3-642-18184-9_9
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-18183-2
Online ISBN: 978-3-642-18184-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics