Exploring Speech Features for Classifying Emotions along Valence Dimension

Koolagudi, Shashidhar G.; Sreenivasa Rao, K.

doi:10.1007/978-3-642-11164-8_87

Shashidhar G. Koolagudi²¹ &
K. Sreenivasa Rao²¹

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 5909))

Included in the following conference series:

International Conference on Pattern Recognition and Machine Intelligence

1545 Accesses
7 Citations

Abstract

Naturalness of human speech is mainly because of the embedded emotions. Today’s speech systems lack the component of emotion processing within them. In this work, classification of emotions from the speech data is attempted. Here we have made an effort to search, emotion specific information from spectral features. Mel frequency cepstral coefficients are used as speech features. Telugu simulated emotion speech corpus (IITKGP-SESC) is used as a data source. The database contains 8 emotions. The experiments are conducted for studying the influence of speaker, gender and language related information on emotion classification. Gaussian mixture models are use to capture the emotion specific information by modeling the distribution. An average emotion detection rate of around 65% and 80% are achieved for gender independent and dependent cases respectively.

Download to read the full chapter text

Chapter PDF

Text-Dependent Versus Text-Independent Speech Emotion Recognition

Robust Emotion Recognition using Pitch Synchronous and Sub-syllabic Spectral Features

Robust Features for Emotion Recognition from Speech by Using Gaussian Mixture Model Classification

Keywords

References

Lee, C.M., Narayanan, S.S.: Toward detecting emotions in spoken dialogs. IEEE Trans. Speech and Audio Processing 13, 293–303 (2005)
Article Google Scholar
Jin, X., Wang, Z.: An Emotion Space Model for Recognition of Emotions in Spoken Chinese. In: Tao, J., Tan, T., Picard, R.W. (eds.) ACII 2005. LNCS, vol. 3784, pp. 397–402. Springer, Heidelberg (2005)
Chapter Google Scholar
Rao, K.S., Yegnanarayana, B.: Intonation modeling for indian languages. CSL (2008)
Google Scholar
Yildirim, S., Bulut, M., Lee, C.M., Kazemzadeh, A., Busso, C., Deng., Z., Lee, S., Narayanan, S.: An acoustic study of emotions expressed in speech. In: Int’l Conf. on Spoken Language Processing (ICSLP 2004), Jeju island, Korean (October 2004)
Google Scholar
Ververidis, D., Kotropoulos, C., Pitas, I.: Automatic emotional speech classifcation. In: ICASSP 2004, pp. I593–I596. IEEE, Los Alamitos (2004)
Google Scholar
Burkhardt, F., Sendlmeier, W.F.: Verification of acousical correlates of emotional speech using formant-synthesis. In: ITRW on Speech and Emotion, Newcastle, Northern Ireland, UK, September 5-7, pp. 151–156 (2000)
Google Scholar
Oudeyer, P.-Y.: The production and recognition of emotions in speech: features and algorithms. International Journal of Human Computer Studies 59, 157–183 (2003)
Article Google Scholar
Koolagudi, S.G., Maity, S., Kumar, V.A., Chakrabarti, S., Rao, K.S.: IITKGP- SESC: Speech Database for Emotion Analysis. In: Communications in Computer and Information Science, JIIT University, Noida, India, August 17-19. Springer, Heidelberg (2009)
Google Scholar
Tato, R., Santos, R., Pardo, R.K.J.: Emotional space improves emotion recognition. In: 7th International Conference on Spoken Language Processing, Denver, Colorado, USA, September 16-20 (2002)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Information Technology, Indian Institute of Technology Kharagpur, Kharagpur, 721302, West Bengal, India
Shashidhar G. Koolagudi & K. Sreenivasa Rao

Authors

Shashidhar G. Koolagudi
View author publications
You can also search for this author in PubMed Google Scholar
K. Sreenivasa Rao
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Electrical Engineering Department, Indian Institute of Technology Delhi, 110016, New Delhi, India
Santanu Chaudhury
Center for Soft Computing Research, Indian Statistical Institute, 700 108, Kolkata, India
Sushmita Mitra
Center for Soft Computing Research, Indian Statistical Institute,
C. A. Murthy
Department of Electrical Engineering, Indian Institute of Science, 560012, Bangalore, INDIA
P. S. Sastry
Center for Soft Computing Research, Machine Intelligence Unit, Indian Statistical Institute, 203 Barrackpore Trunk Road, 700 108, Kolkata, India
Sankar K. Pal

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Koolagudi, S.G., Sreenivasa Rao, K. (2009). Exploring Speech Features for Classifying Emotions along Valence Dimension. In: Chaudhury, S., Mitra, S., Murthy, C.A., Sastry, P.S., Pal, S.K. (eds) Pattern Recognition and Machine Intelligence. PReMI 2009. Lecture Notes in Computer Science, vol 5909. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-11164-8_87

Download citation

DOI: https://doi.org/10.1007/978-3-642-11164-8_87
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-11163-1
Online ISBN: 978-3-642-11164-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The International Association for Pattern Recognition (opens in a new tab)

Exploring Speech Features for Classifying Emotions along Valence Dimension

Abstract

Chapter PDF

Similar content being viewed by others

Text-Dependent Versus Text-Independent Speech Emotion Recognition

Robust Emotion Recognition using Pitch Synchronous and Sub-syllabic Spectral Features

Robust Features for Emotion Recognition from Speech by Using Gaussian Mixture Model Classification

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Societies and partnerships

Navigation

Exploring Speech Features for Classifying Emotions along Valence Dimension

Abstract

Chapter PDF

Similar content being viewed by others

Text-Dependent Versus Text-Independent Speech Emotion Recognition

Robust Emotion Recognition using Pitch Synchronous and Sub-syllabic Spectral Features

Robust Features for Emotion Recognition from Speech by Using Gaussian Mixture Model Classification

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Societies and partnerships

Search

Navigation