Multimodal Laughter Detection in Natural Discourses

Scherer, Stefan; Schwenker, Friedhelm; Campbell, Nick; Palm, Günther

doi:10.1007/978-3-642-10403-9_12

Stefan Scherer⁹,
Friedhelm Schwenker⁹,
Nick Campbell¹⁰ &
…
Günther Palm⁹

Part of the book series: Cognitive Systems Monographs ((COSMOS,volume 6))

1208 Accesses
15 Citations

Abstract

This work focuses on the detection of laughter in natural multiparty discourses. For the given task features of two different modalities are used from unobtrusive sources, namely a room microphone and a 360 degree camera. A relatively novel approach using Echo State Networks (ESN) is utilized to achieve the task at hand. Among others, a possible application is the online detection of laughter in human robot interaction in order to enable the robot to react appropriately in a timely fashion towards human communication, since laughter is an important communication utility.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Campbell, N., Kashioka, H., Ohara, R.: No laughing matter. In: Proceedings of Interspeech, ISCA, pp. 465–468 (2005)
Google Scholar
Campbell, W.N.: Tools and resources for visualising conversational-speech interaction. In: Proceedings of the Sixth International Language Resources and Evaluation (LREC 2008), ELRA, Marrakech, Morocco (2008)
Google Scholar
Drullman, R., Festen, J., Plomp, R.: Effect of reducing slow temporal modulations on speech reception. Journal of the Acousic Society 95, 2670–2680 (1994)
Article Google Scholar
Hermansky, H.: Auditory modeling in automatic recognition of speech. In: Proceedings of Keele Workshop (1996)
Google Scholar
Hermansky, H.: The modulation spectrum in automatic recognition of speech. In: Proceedings of IEEE Workshop on Automatic Speech Recognition and Understanding, pp. 140–147. IEEE, Los Alamitos (1997)
Chapter Google Scholar
Jaeger, H.: Tutorial on training recurrent neural networks, covering bppt, rtrl, ekf and the echo state network approach. Tech. Rep. 159, Fraunhofer-Gesellschaft, St. Augustin Germany (2002)
Google Scholar
Jaeger, H., Haas, H.: Harnessing nonlinearity: Predicting chaotic systems and saving energy in wireless communication. Science 304, 78–80 (2004)
Article Google Scholar
Kennedy, L., Ellis, D.: Laughter detection in meetings. In: Proceedings of NIST ICASSP, Meeting Recognition Workshop (2004)
Google Scholar
Knox, M., Mirghafori, N.: Automatic laughter detection using neural networks. In: Proceedings of Interspeech 2007, ISCA, pp. 2973–2976 (2007)
Google Scholar
Laskowski, K.: Modeling vocal interaction for text-independent detection of involvement hotspots in multi-party meetings. In: Proceedings of the 2nd IEEE/ISCA/ACL Workshop on Spoken Language Technology (SLT 2008), pp. 81–84 (2008)
Google Scholar
Maganti, H.K., Scherer, S., Palm, G.: A novel feature for emotion recognition in voice based applications. In: Paiva, A.C.R., Prada, R., Picard, R.W. (eds.) ACII 2007. LNCS, vol. 4738, pp. 710–711. Springer, Heidelberg (2007)
Chapter Google Scholar
Pugh, S.D.: Service with a smile: Emotional contagion in the service encounter. Academy of Management Journal 44, 1018–1027 (2001)
Article Google Scholar
Scherer, S., Hofmann, H., Lampmann, M., Pfeil, M., Rhinow, S., Schwenker, F., Palm, G.: Emotion recognition from speech: Stress experiment. In: Proceedings of the Sixth International Language Resources and Evaluation (LREC 2008). European Language Resources Association (ELRA), Marrakech, Morocco (2008)
Google Scholar
Scherer, S., Oubbati, M., Schwenker, F., Palm, G.: Real-time emotion recognition from speech using echo state networks. In: Prevost, L., Marinai, S., Schwenker, F. (eds.) ANNPR 2008. LNCS (LNAI), vol. 5064, pp. 205–216. Springer, Heidelberg (2008)
Chapter Google Scholar
Strauss, P.M., Hoffmann, H., Scherer, S.: Evaluation and user acceptance of a dialogue system using wizard-of-oz recordings. In: 3rd IET International Conference on Intelligent Environments, IET, pp. 521–524 (2007)
Google Scholar
Truong, K.P., Van Leeuwen, D.A.: Automatic detection of laughter. In: Proceedings of Interspeech, ISCA, pp. 485–488 (2005)
Google Scholar
Truong, K.P., Van Leeuwen, D.A.: Evaluating laughter segmentation in meetings with acoustic and acoustic-phonetic features. In: Workshop on the Phonetics of Laughter, Saarbrücken, pp. 49–53 (2007)
Google Scholar

Download references

Author information

Authors and Affiliations

Institute of Neural Information Processing, Ulm University,
Stefan Scherer, Friedhelm Schwenker & Günther Palm
Center for Language and Communication Studies, Trinity College Dublin,
Nick Campbell

Authors

Stefan Scherer
View author publications
You can also search for this author in PubMed Google Scholar
Friedhelm Schwenker
View author publications
You can also search for this author in PubMed Google Scholar
Nick Campbell
View author publications
You can also search for this author in PubMed Google Scholar
Günther Palm
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Technische Fakultät, AG Neuroinformatik, Universität Bielefeld, Universitätsstr. 25, 33615, Bielefeld
Helge Ritter
Technische Fakultät, AG Angewandte Informatik, Universität Bielefeld, Universitätsstr. 25, 33615, Bielefeld
Gerhard Sagerer
Technologiefabrik, Uni Karlsruhe, Haid-und-Neu-Strasse 7, D-76131, Karlsruhe
Rüdiger Dillmann
Institute of Automatic Control Engineering (LSR), Technische Universität München, Arcisstraße 21, 80290, München, Germany
Martin Buss

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Scherer, S., Schwenker, F., Campbell, N., Palm, G. (2009). Multimodal Laughter Detection in Natural Discourses. In: Ritter, H., Sagerer, G., Dillmann, R., Buss, M. (eds) Human Centered Robot Systems. Cognitive Systems Monographs, vol 6. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-10403-9_12

Download citation

DOI: https://doi.org/10.1007/978-3-642-10403-9_12
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-10402-2
Online ISBN: 978-3-642-10403-9
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics