MFCC Global Features Selection in Improving Speech Emotion Recognition Rate

Zaidan, Noor Aina; Salam, Md Sah

doi:10.1007/978-3-319-32213-1_13

Noor Aina Zaidan⁶ &
Md Sah Salam⁶

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 387))

1537 Accesses
7 Citations

Abstract

Feature selection is one of the important aspects that contribute most to the emotion recognition system performance apart from the database and the classification technique used. Based on the previous finding, Mel Frequency Cepstral Coefficients (MFCC) are said to be good for emotion recognition purpose. This paper discusses the use of MFCC features to recognize human emotion on Berlin database in the German language. Global features are extracted from MFCC and tested with three classification methods; Naive Bayes, Artificial Neural Network (ANN) and Support Vector Machine (SVM). We investigate the capabilities of MFCC global features using 13, 26 and 39-dimensional cepstral features in recognizing emotions from speech. The result from the experiment will be further discussed in this paper.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Panda B, Padhi D, Dash K, Mohanty PS (2012) Use of SVM classifier & MFCC in speech emotion recognition system. Int J Adv Res Comput Sci Softw Eng 2(3):225–230
Google Scholar
El Ayadi M, Kamel M, Karray F (2011) Survey on speech emotion recognition: features, classification schemes, and databases. Pattern Recogn 44(3):572–587
Google Scholar
Kishore KVK, Satish PK (2013) Emotion recognition in speech using MFCC and wavelet features. In: 2013 IEEE 3rd international advance computing conference (IACC), pp. 842–847
Google Scholar
Zhou Y, Sun Y, Zhang J, Yan Y (2009) Speech emotion recognition using both spectral and prosodic features. In: International conference on information engineering and computer science, pp 1–4, Dec
Google Scholar
Schuller B, Rigoll G, Lang M (2003) Hidden Markov model-based speech emotion recognition. In: ICME ‘03 Proceedings of the 2003 International Conference on Multimedia and Expo—Vol 2, pp 401–404 (2003)
Google Scholar
Origlia A, Galatà V, Ludusan B (2010) Automatic classification of emotions via global and local prosodic features on a multilingual emotional database. In: Proceedings of the 5th international conference on speech prosody
Google Scholar
Huang Y, Zhang G, Li X (2011) Improved emotion recognition with novel global utterance-level features. Appl Math Inf Sci 5(2):147–153
Google Scholar
Ravi Kumar KM, Ganesan S (2011) Comparison of multidimensional MFCC feature vectors for objective assessment of stuttered disfluencies. Int J Adv Netw Appl 860:854–860
Google Scholar
Loughran R, Walker J, O’Neill M, O’Farrell M (2008) The use of mel-frequency cepstral coefficients in musical instrument identification. In: Music conference, Belfast
Google Scholar
Pradier MF (2011) Emotion recognition from speech signals and perception of music. Thesis, University of Stuttgart
Google Scholar
Zhongzhe X (2008) Recognition of emotions in audio signals. Thesis
Google Scholar
Kumar K, Kim C, Stern RM (2011) Delta-spectral cepstral coefficients for robust speech recognition. In: IEEE international conference on acoustics, speech and signal processing (ICASSP), pp 4784–4787
Google Scholar
Hall M, Frank E, Holmes G, Pfahringer B, Reutemann P, Witten IH (2009) The WEKA data mining software: an update. SIGKDD Explor 11(1):10–18
Google Scholar
Burkhardt F, Paeschke A, Rolfes M (2005) A database of German emotional speech. In: Interspeech, pp 3–6
Google Scholar
Neiberg D, Laukka P, Ananthakrishnan G (2010) Classification of affective speech using normalized time-frequency cepstra. In: Fifth international conference on speech prosody
Google Scholar
Qi-Rong M, Zhan Y-Z (2010) A novel hierarchical speech emotion recognition method based on improved DDAGSVM. Comput Sci Inf Syst 7(1):211–222
Article Google Scholar
Shen P, Changjun Z, Chen X (2011) Automatic speech emotion recognition using support vector machine. In: International conference on electronic and mechanical engineering and information technology, pp 621–625, Aug
Google Scholar
Safdarkhani MK, Mojaver SP, Atieghechi S, Riahi MS (2012) Emotion Recognition of Speech Using ANN and GMM. Aust J Basic Appl Sci 6(9):45–57
Google Scholar

Download references

Acknowledgments

Special thanks to Ministry of Education (MOE) and Research Management Centre (RMC), Universiti Teknologi Malaysia providing financial support of this research in FRGS Vot number R.J130000.7828.4F253.

Author information

Authors and Affiliations

Faculty of Computing, Universiti Teknologi Malaysia, 81310, Skudai, Johor, Malaysia
Noor Aina Zaidan & Md Sah Salam

Authors

Noor Aina Zaidan
View author publications
You can also search for this author in PubMed Google Scholar
Md Sah Salam
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Noor Aina Zaidan .

Editor information

Editors and Affiliations

University Teknikal Malaysia Melaka, Durian Tunggal, Melaka, Malaysia
Ping Jack Soh
Singapore Campus, #05-01 SIT Building, Newcastle University, Singapore, Singapore
Wai Lok Woo
Universiti Teknikal Malaysia Melaka, Melaka, Malaysia
Hamzah Asyrani Sulaiman
Universiti Teknikal Malaysia Melaka, Melaka, Malaysia
Mohd Azlishah Othman
University Teknikal Malaysia Melaka, Durian Tunggal, Melaka, Malaysia
Mohd Shakir Saat

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zaidan, N.A., Salam, M.S. (2016). MFCC Global Features Selection in Improving Speech Emotion Recognition Rate. In: Soh, P., Woo, W., Sulaiman, H., Othman, M., Saat, M. (eds) Advances in Machine Learning and Signal Processing. Lecture Notes in Electrical Engineering, vol 387. Springer, Cham. https://doi.org/10.1007/978-3-319-32213-1_13

Download citation

DOI: https://doi.org/10.1007/978-3-319-32213-1_13
Published: 19 June 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-32212-4
Online ISBN: 978-3-319-32213-1
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics