Assessing an Application of Spontaneous Stressed Speech - Emotions Portal

Palacios-Alonso, Daniel; Lázaro-Carrascosa, Carlos; López-Arribas, Agustín; Meléndez-Morales, Guillermo; Gómez-Rodellar, Andrés; Loro-Álavez, Andrés; Nieto-Lluis, Victor; Rodellar-Biarge, Victoria; Tsanas, Athanasios; Gómez-Vilda, Pedro

doi:10.1007/978-3-030-19591-5_16

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 11486))

Included in the following conference series:

International Work-Conference on the Interplay Between Natural and Artificial Computation

1253 Accesses

Abstract

Detecting and identifying emotions expressed in speech signals is a very complex task that generally requires processing a large sample size to extract intricate details and match the diversity of human expression in speech. There is not an emotional dataset commonly accepted as a standard test bench to evaluate the performance of the supervised machine learning algorithms when presented with extracted speech characteristics. This work proposes a generic platform to capture and validate emotional speech. The aim of the platform is collaborative-crowdsourcing and it can be used for any language (currently, it is available in four languages such as Spanish, English, German and French). As an example, a module for elicitation of stress in speech through a set of online interviews and other module for labeling recorded speech have been developed. This study is envisaged as the beginning of an effort to establish a large, cost-free standard speech corpus to assess emotions across multiple languages.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Ley Orgánica 3/2018, de 5 de diciembre, de Protección de Datos Personales y garantía de los derechos digitales - Agencia Estatal Boletín Oficial del Estado. https://www.boe.es/buscar/act.php?id=BOE-A-2018-16673. Accessed 7 Jan 2019
Portal Codec - GFK Group. https://www.portalcodec.com/. Accessed 7 Jan 2019
Arciuli, J., Villar, G., Mallard, D.: Lies, lies and more lies. In: Proceedings of the 31st Annual Conference of the Cognitive Science Society (CogSci 2009), pp. 2329–2334 (2009)
Google Scholar
Burkhardt, F., Paeschke, A., Rolfes, M., Sendlmeier, W.F., Weiss, B.: A database of German emotional speech. In: Interspeech, vol. 5, pp. 1517–1520 (2005)
Google Scholar
Engberg, I.S., Hansen, A.V.: Documentation of the Danish emotional speech database DES. Internal AAU report, Center for Person Kommunikation, Denmark, p. 22 (1996)
Google Scholar
Hansen, J.H., Bou-Ghazale, S.E., Sarikaya, R., Pellom, B.: Getting started with SUSAS: a speech under simulated and actual stress database. In: Eurospeech, vol. 97, pp. 1743–1746 (1997)
Google Scholar
Her - Official Webpage (2013). http://www.herthemovie.com/. Accessed 4 May 2015
Hofbauer, K., Petrik, S., Hering, H.: The ATCOSIM corpus of non-prompted clean air traffic control speech. In: LREC (2008)
Google Scholar
Moore, E., Clements, M.A., Peifer, J.W., Weisser, L.: Critical analysis of the impact of glottal features in the classification of clinical depression in speech. IEEE Trans. Biomed. Eng. 55(1), 96–107 (2008)
Article Google Scholar
Muñoz-Mulas, C., et al.: KPCA vs. PCA study for an age classification of speakers. In: Travieso-González, C.M., Alonso-Hernández, J.B. (eds.) NOLISP 2011. LNCS (LNAI), vol. 7015, pp. 190–198. Springer, Heidelberg (2011). https://doi.org/10.1007/978-3-642-25020-0_25
Chapter Google Scholar
Ramakrishnan, S.: Recognition of emotion from speech: a review. In: Speech Enhancement, Modeling and Recognition-Algorithms and Applications, p. 121 (2012)
Google Scholar
Robot and Frank - IMDB Webpage (2012). http://www.imdb.com/title/tt1990314/. Accessed 4 May 2015
Rodellar, V., Palacios, D., Gomez, P., Bartolome, E.: A methodology for monitoring emotional stress in phonation. In: 2014 5th IEEE Conference on Cognitive Infocommunications (CogInfoCom), pp. 231–236. IEEE (2014)
Google Scholar
Rodellar-Biarge, V., Palacios-Alonso, D., Nieto-Lluis, V., Gómez-Vilda, P.: Towards the search of detection in speech-relevant features for stress. Expert Syst. 32, 701–718 (2015)
Article Google Scholar
Ververidis, D., Kotropoulos, C.: A review of emotional speech databases. In: Proceedings of the Panhellenic Conference on Informatics (PCI), pp. 560–574 (2003)
Google Scholar

Download references

Acknowledgments

This work is being funded by grants TEC2016-77791-C4-4-R (MINECO, Spain) and CENIE _ TECA – PARK_55_02 INTERREG V – A Spain – Portugal (POCTEP).

Author information

Authors and Affiliations

Escuela Técnica Superior de Ingeniería Informática - Universidad Rey Juan Carlos, Campus de Móstoles, Tulipán, s/n, 28933, Móstoles, Madrid, Spain
Daniel Palacios-Alonso, Carlos Lázaro-Carrascosa, Agustín López-Arribas & Guillermo Meléndez-Morales
Neuromorphic Speech Processing Lab, Center for Biomedical Technology, Universidad Politécnica de Madrid, Campus de Montegancedo, 28223, Pozuelo de Alarcón, Madrid, Spain
Daniel Palacios-Alonso, Carlos Lázaro-Carrascosa, Andrés Gómez-Rodellar, Victor Nieto-Lluis, Victoria Rodellar-Biarge & Pedro Gómez-Vilda
Usher Institute of Population Health Sciences and Informatics, University of Edinburgh, Edinburgh, UK
Athanasios Tsanas
Hermosilla 60 Legal Counselors, Hermosilla 60, 28001, Madrid, Spain
Andrés Loro-Álavez

Authors

Daniel Palacios-Alonso
View author publications
You can also search for this author in PubMed Google Scholar
Carlos Lázaro-Carrascosa
View author publications
You can also search for this author in PubMed Google Scholar
Agustín López-Arribas
View author publications
You can also search for this author in PubMed Google Scholar
Guillermo Meléndez-Morales
View author publications
You can also search for this author in PubMed Google Scholar
Andrés Gómez-Rodellar
View author publications
You can also search for this author in PubMed Google Scholar
Andrés Loro-Álavez
View author publications
You can also search for this author in PubMed Google Scholar
Victor Nieto-Lluis
View author publications
You can also search for this author in PubMed Google Scholar
Victoria Rodellar-Biarge
View author publications
You can also search for this author in PubMed Google Scholar
Athanasios Tsanas
View author publications
You can also search for this author in PubMed Google Scholar
Pedro Gómez-Vilda
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Daniel Palacios-Alonso .

Editor information

Editors and Affiliations

Universidad Politécnica de Cartagena, Cartagena, Spain
José Manuel Ferrández Vicente
Universidad Nacional de Educación a Distancia, Madrid, Spain
José Ramón Álvarez-Sánchez
Universidad Nacional de Educación a Distancia, Madrid, Madrid, Spain
Félix de la Paz López
Universidad Politécnica de Cartagena, Cartagena, Spain
Javier Toledo Moreo
The Ohio State University, Columbus, OH, USA
Hojjat Adeli

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Palacios-Alonso, D. et al. (2019). Assessing an Application of Spontaneous Stressed Speech - Emotions Portal. In: Ferrández Vicente, J., Álvarez-Sánchez, J., de la Paz López, F., Toledo Moreo, J., Adeli, H. (eds) Understanding the Brain Function and Emotions. IWINAC 2019. Lecture Notes in Computer Science(), vol 11486. Springer, Cham. https://doi.org/10.1007/978-3-030-19591-5_16

Download citation

DOI: https://doi.org/10.1007/978-3-030-19591-5_16
Published: 10 May 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-19590-8
Online ISBN: 978-3-030-19591-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics