Abstract
Although an increasing amount of research has been carried out into human-machine interaction in the last century, even today we are not able to fully understand the dynamic changes in human interaction. Only when we achieve this, will we be able to go beyond a one-to-one mapping between text and speech and be able to add social information to speech technologies. Social information is expressed to a high degree through prosodic cues and movement of the body and the face. The aim of this paper is to use those cues to make one aspect of social information more tangible; namely participants’ degree of involvement in a conversation. Our results for voice span and intensity, and our preliminary results on the movement of the body and face suggest that these cues are reliable cues for the detection of distinct levels of participants involvement in conversation. This will allow for the development of a statistical model which is able to classify these stages of involvement. Our data indicate that involvement may be a scalar phenomenon.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Antil, J.H.: Conceptualization and Operationalization of Involvement. Advances in Consumer Research 11(1), 203–209 (1984)
Wrede, B., Shriberg, E.: Spotting Hot Spots in Meetings: Human Judgements and Prosodic Cues. In: Proceedings of Eurospeech 2003, Geneva, pp. 2805–2808 (2003)
Dillon, R.: Lecture Notes in Computer Science: A Possible Model for Predicting Listener’s Emotional Engagement. Springer, Heidelberg (2006)
Selting, M.: Emphatic speech style: with special focus on the prosodic signalling of heightened emotive involvement in conversation. Journal of pragmatics 22(3-4), 375–408 (1994)
Gustafson, J., Neiberg, D.: Prosodic cues to engagement in non- lexical response tokens in Swedish. In: DiSS-LPSS Joint Workshop 2010, Tokyo, Japan (2010)
Yu, C., Aoki, P.M., Woodruff, A.: Detecting user engagement in everyday conversations. In: 8th International Conference on Spoken Language Processing (ICSLP 2004), Jeju Island, Korea, pp. 1329–1332 (2004)
Gatica-Perez, D.: Modeling Interest in Face-to-Face Conversations from Multimodal Nonverbal Behavior. In: Thiran, J.-P., Bourlard, H., Marques, F. (eds.) Multimodal Signal Processing, pp. 309–323. Academic Press, San Diego (2009)
Duncan, S., Baldenebro, T., Lawandow, A., Levow, G.-A.: Multi-modal Analysis of Interactional Rapport in Three Language Cultural Groups. In: Workshop on Modeling Human Communication Dynamics, Vancouver, B.C., Canada, pp. 42–45 (2010)
Crystal, D., Davy, D.: Investigating English Style. Longman Group. Ltd., London (1969)
Oertel, C., Cummins, F., Campbell, N., Edlund, J., Wagner, P.: D64: a corpus of richly recorded conversational interaction. In: Proceedings of LREC 2010; Workshop on Multimodal Corpora: Advances in Capturing, Coding and Analyzing Multimodality, Valetta, pp. 27–30 (2010)
Oertel, C.: Identification of Cues for the Automatic Detection of Hotspots. Bielefeld University, Bielefeld (2010) (unpublished)
Boersma, P., Weenink, D.: Praat: doing phonetics by computer
De Looze, C., Hirst, D.J.: Integrating changes of register into automatic intonation analysis. In: Proceedings of the Speech Prosody 2010 Conferene, Chicago, 4 pages (2010)
Tamburini, F., Wagner, P.: On automatic prominence detection for german. In: Proceedings of Interspeech 2007, Antwerp, pp. 1809–1802 (2007)
Scherer, S., Campbell, N.: Multimodal laughter detection in natural discourses. In: Proceedings of the 3rd International Workshop on Human-Centered Robotic Systems (HCRS 2009), pp. 111–121 (2009)
Viola, P., Jones, M.J.: Robust real-time face detection. International Journal of Computer Vision 57(2), 137–154 (2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Oertel, C., De Looze, C., Scherer, S., Windmann, A., Wagner, P., Campbell, N. (2011). Towards the Automatic Detection of Involvement in Conversation. In: Esposito, A., Vinciarelli, A., Vicsi, K., Pelachaud, C., Nijholt, A. (eds) Analysis of Verbal and Nonverbal Communication and Enactment. The Processing Issues. Lecture Notes in Computer Science, vol 6800. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-25775-9_16
Download citation
DOI: https://doi.org/10.1007/978-3-642-25775-9_16
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-25774-2
Online ISBN: 978-3-642-25775-9
eBook Packages: Computer ScienceComputer Science (R0)