Skip to main content

MFCC Global Features Selection in Improving Speech Emotion Recognition Rate

  • Conference paper
  • First Online:
Advances in Machine Learning and Signal Processing

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 387))

Abstract

Feature selection is one of the important aspects that contribute most to the emotion recognition system performance apart from the database and the classification technique used. Based on the previous finding, Mel Frequency Cepstral Coefficients (MFCC) are said to be good for emotion recognition purpose. This paper discusses the use of MFCC features to recognize human emotion on Berlin database in the German language. Global features are extracted from MFCC and tested with three classification methods; Naive Bayes, Artificial Neural Network (ANN) and Support Vector Machine (SVM). We investigate the capabilities of MFCC global features using 13, 26 and 39-dimensional cepstral features in recognizing emotions from speech. The result from the experiment will be further discussed in this paper.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Panda B, Padhi D, Dash K, Mohanty PS (2012) Use of SVM classifier & MFCC in speech emotion recognition system. Int J Adv Res Comput Sci Softw Eng 2(3):225–230

    Google Scholar 

  2. El Ayadi M, Kamel M, Karray F (2011) Survey on speech emotion recognition: features, classification schemes, and databases. Pattern Recogn 44(3):572–587

    Google Scholar 

  3. Kishore KVK, Satish PK (2013) Emotion recognition in speech using MFCC and wavelet features. In: 2013 IEEE 3rd international advance computing conference (IACC), pp. 842–847

    Google Scholar 

  4. Zhou Y, Sun Y, Zhang J, Yan Y (2009) Speech emotion recognition using both spectral and prosodic features. In: International conference on information engineering and computer science, pp 1–4, Dec

    Google Scholar 

  5. Schuller B, Rigoll G, Lang M (2003) Hidden Markov model-based speech emotion recognition. In: ICME ‘03 Proceedings of the 2003 International Conference on Multimedia and Expo—Vol 2, pp 401–404 (2003)

    Google Scholar 

  6. Origlia A, Galatà V, Ludusan B (2010) Automatic classification of emotions via global and local prosodic features on a multilingual emotional database. In: Proceedings of the 5th international conference on speech prosody

    Google Scholar 

  7. Huang Y, Zhang G, Li X (2011) Improved emotion recognition with novel global utterance-level features. Appl Math Inf Sci 5(2):147–153

    Google Scholar 

  8. Ravi Kumar KM, Ganesan S (2011) Comparison of multidimensional MFCC feature vectors for objective assessment of stuttered disfluencies. Int J Adv Netw Appl 860:854–860

    Google Scholar 

  9. Loughran R, Walker J, O’Neill M, O’Farrell M (2008) The use of mel-frequency cepstral coefficients in musical instrument identification. In: Music conference, Belfast

    Google Scholar 

  10. Pradier MF (2011) Emotion recognition from speech signals and perception of music. Thesis, University of Stuttgart

    Google Scholar 

  11. Zhongzhe X (2008) Recognition of emotions in audio signals. Thesis

    Google Scholar 

  12. Kumar K, Kim C, Stern RM (2011) Delta-spectral cepstral coefficients for robust speech recognition. In: IEEE international conference on acoustics, speech and signal processing (ICASSP), pp 4784–4787

    Google Scholar 

  13. Hall M, Frank E, Holmes G, Pfahringer B, Reutemann P, Witten IH (2009) The WEKA data mining software: an update. SIGKDD Explor 11(1):10–18

    Google Scholar 

  14. Burkhardt F, Paeschke A, Rolfes M (2005) A database of German emotional speech. In: Interspeech, pp 3–6

    Google Scholar 

  15. Neiberg D, Laukka P, Ananthakrishnan G (2010) Classification of affective speech using normalized time-frequency cepstra. In: Fifth international conference on speech prosody

    Google Scholar 

  16. Qi-Rong M, Zhan Y-Z (2010) A novel hierarchical speech emotion recognition method based on improved DDAGSVM. Comput Sci Inf Syst 7(1):211–222

    Article  Google Scholar 

  17. Shen P, Changjun Z, Chen X (2011) Automatic speech emotion recognition using support vector machine. In: International conference on electronic and mechanical engineering and information technology, pp 621–625, Aug

    Google Scholar 

  18. Safdarkhani MK, Mojaver SP, Atieghechi S, Riahi MS (2012) Emotion Recognition of Speech Using ANN and GMM. Aust J Basic Appl Sci 6(9):45–57

    Google Scholar 

Download references

Acknowledgments

Special thanks to Ministry of Education (MOE) and Research Management Centre (RMC), Universiti Teknologi Malaysia providing financial support of this research in FRGS Vot number R.J130000.7828.4F253.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Noor Aina Zaidan .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this paper

Cite this paper

Zaidan, N.A., Salam, M.S. (2016). MFCC Global Features Selection in Improving Speech Emotion Recognition Rate. In: Soh, P., Woo, W., Sulaiman, H., Othman, M., Saat, M. (eds) Advances in Machine Learning and Signal Processing. Lecture Notes in Electrical Engineering, vol 387. Springer, Cham. https://doi.org/10.1007/978-3-319-32213-1_13

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-32213-1_13

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-32212-4

  • Online ISBN: 978-3-319-32213-1

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics