A Hybrid Neural Emotion Recogniser for Human-Robotic Agent Interaction
This paper presents a hybrid neural approach to emotion recognition from speech, which combines feature selection using principal component analysis (PCA) with unsupervised neural clustering through self-organising map (SOM). Given the importance that is associated with emotions in humans, it is unlikely that robots will be accepted as anything more that machines if they do not express and recognise emotions. In this paper, we describe the performance of an unsupervised approach to emotion recognition that achieves similar performance to current supervised intelligent approaches. Performance, however, reduces when the system is tested using samples from a male volunteer not in the training set using a low cost microphone. Through the use of an unsupervised neural approach, it is possible to go beyond the basic binary classification of emotions to consider the similarity between emotions and whether speech can express multiple emotions at the same time.
KeywordsEmotion recognition social robot interaction unsupervised neural learning
Unable to display preview. Download preview PDF.
- 2.Attias, H.: Learning in High Dimensions: Modular Mixture Models. Microsoft Research, USA (2001)Google Scholar
- 4.Burkhardt, F., Paeschke, A., Rolfe, M., Sendlmeier, W., Weis, B.: A Database of German Emotional Speech. In: Interspeech, Lisbon (2005)Google Scholar
- 7.Eyben, F., Woellmer, M., Schuller, B.: openSMILE - The Munich Versatile and Fast Open-Source Audio Feature Extractor. ACM Multimedia, 1459–1462 (2010)Google Scholar
- 8.Hall, M.: Correlation-based Feature Selection for Machine Learning (1999)Google Scholar
- 9.Haykin, S.: Neural Networks: A Comprehensive Foundation, Toronto, Canada. Macmillian College Publishing Company (1994)Google Scholar
- 10.Holmes, J., Holmes, W.: Speech Synthesis and Recognition. Taylor and Francis, London (2001)Google Scholar
- 11.Huang, Y., Zhang, G., Xu, X.: Speech Emotion Recognition Research Based on the Stacked Generalization Ensemble Neural Network for Robot Pet. In: Pattern Recognition, CCPR, pp. 1–5 (2009)Google Scholar
- 14.Pan, Y., Shen, P., Shen, L.: Speech Emotion Recognition Using Support Vector Machine. International Journal of Smart Home 6(2), 101–107 (2012)Google Scholar
- 15.Shami, M., Verhelst, W.: An evaluation of the robustness of existing supervised machine learning approaches to the classification of emotions in speech. Speech Communication 49(3) (2007)Google Scholar
- 16.Slavova, V., Verhelst, W., Sahli, H.: A Cognitive Science Reasoning in Recognition of Emotions in Audio-Visual Speech. International Journal Information Technologies and Knowledge 2, 324–334 (2008)Google Scholar
- 18.ten Bosch, L., Van Hamme, H., Boves, L., Moore, R.K.: A computational model of language acquisition: the emergence of words. Fundamenta Informaticae 90, 229–249 (2009)Google Scholar
- 19.Traunmüller, H., Eriksson, A.: The Frequency Range of the Voice Fundamental in the Speech of Male and Female Adults. Department of Linguistics, University of Stockholm, Stockholm (1994)Google Scholar