Analysis of Speech Emotions Using Dynamics of Prosodic Parameters

Palo, Hemanta Kumar; Mohanty, Mihir N.

doi:10.1007/978-981-15-1451-7_36

Hemanta Kumar Palo¹⁸ &
Mihir N. Mohanty¹⁸

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 1040))

641 Accesses
6 Citations

Abstract

In this paper, an attempt is made to explore the dynamics of speech prosody to characterize and classify emotional states in a speech signal. The local or fine variations describing the prosodic dynamics are combined with the static prosodic parameters for a possible enhancement in the emotional speech recognition (ESR) accuracy. The efficient vector quantization (VQ) clustering algorithm has been applied to compress the static and dynamic parameters before further processing in a radial basis neural network (RBFNN) platform. Results reveal an improvement in ESR accuracy of 86.05% by involving both static and dynamic prosodic features as compared to 84.92% accuracy when the combination of static prosodic feature simulated alone.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Palo, H.K., Mohanty, M.N.: Compartive analysis of neural networks for speech emotion recognition. Int. J. Eng. Technol. 7(4), 111–126 (2018)
Google Scholar
Rao, K.S., Reddy, R., Maity, S., Koolagudi, S. G.: Characterization of emotions using the dynamics of prosodic features. In: Speech Prosody 2010-Fifth International Conference (2010)
Google Scholar
Mannepalli, K., Maloji, S., Sastry, P.N., Danthala, S., Mannepalli, D.P.: Text independent emotion recognition for Telugu speech by using prosodic features. Int. J. Eng. Technol. 7(4), 111–126; 7(2), 594–596 (2018)
Article Google Scholar
Cao, H., Verma, R., Nenkova, A.: Speaker-sensitive emotion recognition via ranking: studies on acted and spontaneous speech. Comput. Speech Lang. 29(1), 186–202 (2015)
Article Google Scholar
Palo, H.K., Mohanty, M.N.: Modified-VQ features for speech emotion recognition. J. Appl. Sci. 16(9), 406–418 (2016)
Article Google Scholar
Ramakrishnan, S.: Recognition of emotion from speech: a review. In: Speech Enhancement, Modeling and Recognition-Algorithms and Applications. InTech (2012)
Google Scholar
Mishra, A.N., Chandra, M., Biswas, A., Sharan, S.N.: Robust features for connected Hindi digits recognition. Int. J. Sign. Process. Image Process. Pattern Recogn. 4(2), 79–90 (2011)
Google Scholar
Kwon, O.W., Chan, K., Hao, J., Lee, T-W.: Emotion recognition by speech signals. In: Interspeech (2003)
Google Scholar
Palo, H.K., Chandra, M., Mohanty, M.N.: Recognition of human speech emotion using variants of Mel-Frequency cepstral coefficients. In: Advances in Systems, Control and Automation, pp. 491–498. Springer, Singapore (2018)
Google Scholar
Jackson, P., Haq, S.: Surrey audio-visual expressed emotion (SAVEE) database, pp. 398–423. University of Surrey, Guildford, UK (2014)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Electronics and Communication Engineering, ITER, Siksha ‘O’ Anusandhan University, Bhubaneswar, Odisha, India
Hemanta Kumar Palo & Mihir N. Mohanty

Authors

Hemanta Kumar Palo
View author publications
You can also search for this author in PubMed Google Scholar
Mihir N. Mohanty
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Mihir N. Mohanty .

Editor information

Editors and Affiliations

School of Computer Engineering, Kalinga Institute of Industrial Technology (KIIT) Deemed to be University, Bhubaneswar, Odisha, India
Pradeep Kumar Mallick
Faculty of Engineering, Aurel Vlaicu University of Arad, Arad, Romania
Valentina Emilia Balas
Department of Electrical and Electronics Engineering, Sikkim Manipal Institute of Technology, Sikkim Manipal University, Rangpo, India
Akash Kumar Bhoi
Division of Information and Communication, Baekseok University, Cheonan-si, Ch’ungch’ong-namdo, Korea (Republic of)
Gyoo-Soo Chae

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Palo, H.K., Mohanty, M.N. (2020). Analysis of Speech Emotions Using Dynamics of Prosodic Parameters. In: Mallick, P., Balas, V., Bhoi, A., Chae, GS. (eds) Cognitive Informatics and Soft Computing. Advances in Intelligent Systems and Computing, vol 1040. Springer, Singapore. https://doi.org/10.1007/978-981-15-1451-7_36

Download citation

DOI: https://doi.org/10.1007/978-981-15-1451-7_36
Published: 15 January 2020
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-1450-0
Online ISBN: 978-981-15-1451-7
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics