Identification of Multimodal Signals for Emotion Recognition in the Context of Human-Robot Interaction

Pérez, Andrea K.; Quintero, Carlos A.; Rodríguez, Saith; Rojas, Eyberth; Peña, Oswaldo; De La Rosa, Fernando

doi:10.1007/978-3-319-76261-6_6

Andrea K. Pérez¹¹,
Carlos A. Quintero¹¹,
Saith Rodríguez¹¹,
Eyberth Rojas¹¹,
Oswaldo Peña¹¹ &
…
Fernando De La Rosa¹²

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 820))

Included in the following conference series:

International Symposium on Intelligent Computing Systems

909 Accesses
3 Citations
1 Altmetric

Abstract

This paper presents a proposal for the identification of multimodal signals for recognizing 4 human emotions in the context of human-robot interaction, specifically, the following emotions: happiness, anger, surprise and neutrality. We propose to implement a multiclass classifier that is based on two unimodal classifiers: one to process the input data from a video signal and another one that uses audio. On one hand, for detecting the human emotions using video data we have propose a multiclass image classifier based on a convolutional neural network that achieved \(86.4\%\) of generalization accuracy for individual frames and \(100\%\) when used to detect emotions in a video stream. On the other hand, for the emotion detection using audio data we have proposed a multiclass classifier based on several one-class classifiers, one for each emotion, achieving a generalization accuracy of \(69.7\%\). The complete system shows a generalization error of \(0\%\) and is tested with several real users in an sales-robot application.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Kitano, H., Asada, M., Kuniyoshi, Y., Noda, I., Osawa, E., Matsubara, H.: Robocup: a challenge problem for AI. AI Mag. 18(1), 73 (1997)
Google Scholar
Christensen, H.I., Batzinger, T., Bekris, K., Bohringer, K., Bordogna, J., Bradski, G., Brock, O., Burnstein, J., Fuhlbrigge, T., Eastman, R., et al.: A roadmap for us robotics: from internet to robotics. Computing Community Consortium (2009)
Google Scholar
Multi-Annual Roadmap. For horizon 2020. SPARC Robotics, eu-Robotics AISBL, Brussels, Belgium (2017)
Google Scholar
Dhall, A., Ramana Murthy, O., Goecke, R., Joshi, J., Gedeon, T.: Video and image based emotion recognition challenges in the wild: Emotiw 2015. In: Proceedings of the 2015 ACM on International Conference on Multimodal Interaction, pp. 423–426. ACM (2015)
Google Scholar
Goodrich, M.A., Schultz, A.C.: Human-robot interaction: a survey. Found. Trends Hum. Comput. Interact. 1(3), 203–275 (2007)
Article MATH Google Scholar
van Beek, L., Chen, K., Holz, D., Matamoros, M., Rascon, C., Rudinac, M., des Solar, J.R., Wachsmuth, S.: Robocup@ home 2015: Rule and regulations (2015)
Google Scholar
Akgun, B., Cakmak, M., Jiang, K., Thomaz, A.L.: Keyframe-based learning from demonstration. Int. J. Soc. Robot. 4(4), 343–355 (2012)
Article Google Scholar
Luo, R.C., Wu, Y.C.: Hand gesture recognition for human-robot interaction for service robot. In: 2012 IEEE Conference on Multisensor Fusion and Integration for Intelligent Systems (MFI), pp. 318–323. IEEE (2012)
Google Scholar
Alonso-Martín, F., Malfaz, M., Sequeira, J., Gorostiza, J.F., Salichs, M.A.: A multimodal emotion detection system during human-robot interaction. Sensors 13(11), 15549–15581 (2013)
Article Google Scholar
Subashini, K., Palanivel, S., Ramalingam, V.: Audio-video based classification using SVM and AANN. Int. J. Comput. Appl. 53(18), 43–49 (2012)
Google Scholar
Agrawal, U., Giripunje, S., Bajaj, P.: Emotion and gesture recognition with soft computing tool for drivers assistance system in human centered transportation. In: 2013 IEEE International Conference on Systems, Man, and Cybernetics (SMC), pp. 4612–4616. IEEE (2013)
Google Scholar
LeCun, Y., Bottou, L., Bengio, Y., Haffner, P.: Gradient-based learning applied to document recognition. Proc. IEEE 86(11), 2278–2324 (1998)
Article Google Scholar
Deng, L., Dong, Y.: Deep learning: methods and applications. Found. Trends Signal Process. 7(3–4), 197–387 (2014)
Article MathSciNet MATH Google Scholar
Rodriguez, S., Pérez, K., Quintero, C., López, J., Rojas, E., Calderón, J.: Identification of multimodal human-robot interaction using combined kernels. In: Snášel, V., Abraham, A., Krömer, P., Pant, M., Muda, A.K. (eds.) Innovations in Bio-Inspired Computing and Applications. AISC, vol. 424, pp. 263–273. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-28031-8_23
Chapter Google Scholar
Kahou, S.E., Bouthillier, X., Lamblin, P., Gulcehre, C., Michalski, V., Konda, K., Jean, S., Froumenty, P., Dauphin, Y., Boulanger-Lewandowski, N., et al.: Emonets: multimodal deep learning approaches for emotion recognition in video. J. Multimodal User Interfaces 10(2), 99–111 (2016)
Article Google Scholar
Vedaldi, A., Lenc, K.: Matconvnet – convolutional neural networks for MATLAB. In: Proceeding of the ACM International Conference on Multimedia (2015)
Google Scholar
Django: Aquila digital signal processing C++ library (2014). https://aquila-dsp.org/
Libsvm – a library for support vector machines (2015). https://www.csie.ntu.edu.tw/~cjlin/libsvm/

Download references

Author information

Authors and Affiliations

Universidad Santo Tomás, Bogotá D.C., Colombia
Andrea K. Pérez, Carlos A. Quintero, Saith Rodríguez, Eyberth Rojas & Oswaldo Peña
Universidad de los Andes, Bogotá D.C., Colombia
Fernando De La Rosa

Authors

Andrea K. Pérez
View author publications
You can also search for this author in PubMed Google Scholar
Carlos A. Quintero
View author publications
You can also search for this author in PubMed Google Scholar
Saith Rodríguez
View author publications
You can also search for this author in PubMed Google Scholar
Eyberth Rojas
View author publications
You can also search for this author in PubMed Google Scholar
Oswaldo Peña
View author publications
You can also search for this author in PubMed Google Scholar
Fernando De La Rosa
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Carlos A. Quintero .

Editor information

Editors and Affiliations

Autonomous University of Yucatán, Merida, Mexico
Carlos Brito-Loeza
Autonomous University of Yucatán, Merida, Mexico
Arturo Espinosa-Romero

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Pérez, A.K., Quintero, C.A., Rodríguez, S., Rojas, E., Peña, O., De La Rosa, F. (2018). Identification of Multimodal Signals for Emotion Recognition in the Context of Human-Robot Interaction. In: Brito-Loeza, C., Espinosa-Romero, A. (eds) Intelligent Computing Systems. ISICS 2018. Communications in Computer and Information Science, vol 820. Springer, Cham. https://doi.org/10.1007/978-3-319-76261-6_6

Download citation

DOI: https://doi.org/10.1007/978-3-319-76261-6_6
Published: 17 February 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-76260-9
Online ISBN: 978-3-319-76261-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics