Abstract
Feature selection is one of the important aspects that contribute most to the emotion recognition system performance apart from the database and the classification technique used. Based on the previous finding, Mel Frequency Cepstral Coefficients (MFCC) are said to be good for emotion recognition purpose. This paper discusses the use of MFCC features to recognize human emotion on Berlin database in the German language. Global features are extracted from MFCC and tested with three classification methods; Naive Bayes, Artificial Neural Network (ANN) and Support Vector Machine (SVM). We investigate the capabilities of MFCC global features using 13, 26 and 39-dimensional cepstral features in recognizing emotions from speech. The result from the experiment will be further discussed in this paper.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Panda B, Padhi D, Dash K, Mohanty PS (2012) Use of SVM classifier & MFCC in speech emotion recognition system. Int J Adv Res Comput Sci Softw Eng 2(3):225–230
El Ayadi M, Kamel M, Karray F (2011) Survey on speech emotion recognition: features, classification schemes, and databases. Pattern Recogn 44(3):572–587
Kishore KVK, Satish PK (2013) Emotion recognition in speech using MFCC and wavelet features. In: 2013 IEEE 3rd international advance computing conference (IACC), pp. 842–847
Zhou Y, Sun Y, Zhang J, Yan Y (2009) Speech emotion recognition using both spectral and prosodic features. In: International conference on information engineering and computer science, pp 1–4, Dec
Schuller B, Rigoll G, Lang M (2003) Hidden Markov model-based speech emotion recognition. In: ICME ‘03 Proceedings of the 2003 International Conference on Multimedia and Expo—Vol 2, pp 401–404 (2003)
Origlia A, Galatà V, Ludusan B (2010) Automatic classification of emotions via global and local prosodic features on a multilingual emotional database. In: Proceedings of the 5th international conference on speech prosody
Huang Y, Zhang G, Li X (2011) Improved emotion recognition with novel global utterance-level features. Appl Math Inf Sci 5(2):147–153
Ravi Kumar KM, Ganesan S (2011) Comparison of multidimensional MFCC feature vectors for objective assessment of stuttered disfluencies. Int J Adv Netw Appl 860:854–860
Loughran R, Walker J, O’Neill M, O’Farrell M (2008) The use of mel-frequency cepstral coefficients in musical instrument identification. In: Music conference, Belfast
Pradier MF (2011) Emotion recognition from speech signals and perception of music. Thesis, University of Stuttgart
Zhongzhe X (2008) Recognition of emotions in audio signals. Thesis
Kumar K, Kim C, Stern RM (2011) Delta-spectral cepstral coefficients for robust speech recognition. In: IEEE international conference on acoustics, speech and signal processing (ICASSP), pp 4784–4787
Hall M, Frank E, Holmes G, Pfahringer B, Reutemann P, Witten IH (2009) The WEKA data mining software: an update. SIGKDD Explor 11(1):10–18
Burkhardt F, Paeschke A, Rolfes M (2005) A database of German emotional speech. In: Interspeech, pp 3–6
Neiberg D, Laukka P, Ananthakrishnan G (2010) Classification of affective speech using normalized time-frequency cepstra. In: Fifth international conference on speech prosody
Qi-Rong M, Zhan Y-Z (2010) A novel hierarchical speech emotion recognition method based on improved DDAGSVM. Comput Sci Inf Syst 7(1):211–222
Shen P, Changjun Z, Chen X (2011) Automatic speech emotion recognition using support vector machine. In: International conference on electronic and mechanical engineering and information technology, pp 621–625, Aug
Safdarkhani MK, Mojaver SP, Atieghechi S, Riahi MS (2012) Emotion Recognition of Speech Using ANN and GMM. Aust J Basic Appl Sci 6(9):45–57
Acknowledgments
Special thanks to Ministry of Education (MOE) and Research Management Centre (RMC), Universiti Teknologi Malaysia providing financial support of this research in FRGS Vot number R.J130000.7828.4F253.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Zaidan, N.A., Salam, M.S. (2016). MFCC Global Features Selection in Improving Speech Emotion Recognition Rate. In: Soh, P., Woo, W., Sulaiman, H., Othman, M., Saat, M. (eds) Advances in Machine Learning and Signal Processing. Lecture Notes in Electrical Engineering, vol 387. Springer, Cham. https://doi.org/10.1007/978-3-319-32213-1_13
Download citation
DOI: https://doi.org/10.1007/978-3-319-32213-1_13
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-32212-4
Online ISBN: 978-3-319-32213-1
eBook Packages: EngineeringEngineering (R0)