Emotion Recognition from Speech Using Perceptual Features and Convolutional Neural Networks

Revathi, A.; Nagakrishnan, R.; Vishnu Vashista, D.; Teja, Kuppa Sai Sri; Sasikaladevi, N.

doi:10.1007/978-981-15-3992-3_29

A. Revathi³⁸,
R. Nagakrishnan³⁸,
D. Vishnu Vashista³⁸,
Kuppa Sai Sri Teja³⁸ &
…
N. Sasikaladevi³⁹

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 656))

697 Accesses
2 Citations

Abstract

Emotional computing has played a crucial role in acting as an interface between humans and machines. Speech based emotion recognition system is difficult to be implemented because of the dataset which is containing a limited number of speeches. In this work, multi speaker independent emotion recognition encompasses the use of perceptual features with filters spaced in BARK scale and Equivalent rectangular bandwidth (ERB) and vector quantization (VQ) for classifying groups and convolutional neural network with backpropagation algorithm for emotion classification in a group. The proposed system has provided consistently better accuracy for the perceptual feature with critical band analysis done in ERB scale with overall accuracy as 86% and decision level fusion classification yielded 100% accuracy for all emotions. Speaker dependent emotion recognition system has provided 100% as accuracy for all the emotions for ERB-PLPC features and perceptual linear predictive cepstrum has given 100% as accuracy for all emotions except sad emotion.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Lee C-C, Mower E, Busso C, Lee S, Narayanan S (2011) Emotion recognition using a hierarchical binary decision tree approach. Speech Commun 53:1162–1171
Article Google Scholar
Wua S, Falk TH, Chan W-Y (2011) Automatic speech emotion recognition using modulation spectral features. Speech Commun 53:768–785
Article Google Scholar
Sreenivasa Rao K, Kumar TP, Anusha K, Leela B, Bhavana I, Gowtham SVSK (2012) Emotion recognition from speech. Int J Comput Sci Inf Technol 3(2):3603–3607
Google Scholar
Sapra A, Panwar N, Panwar S (2013) Emotion recognition from speech. Int J Emerg Technol Adv Eng 3(2). ISSN 2250–2459, ISO 9001:2008 Certified Journal
Google Scholar
Koolagudi SG, Sharma K, Sreenivasa Rao K (2012) Speaker recognition in emotional environment. Commun Comput Inf Sci 305:117–124
Google Scholar
Nithya Roopa S, Prabhakaran M, Betty P (2018) Speech emotion recognition using deep learning. Int J Recent Technol Eng (IJRTE) 7(4S):247–250
Google Scholar
Huang K, Wu C, Hong Q, Su M, Chen Y (2019) Speech emotion recognition using deep neural network considering verbal and nonverbal speech sounds. In: ICASSP 2019–2019 IEEE international conference on acoustics, speech and signal processing (ICASSP), Brighton, United Kingdom, pp 5866–5870
Google Scholar
Revathi A, Venkataramani Y (2012) Perceptual features based continuous speech recognition in additive noise environment using various modeling techniques. STM J Curr Trends Sig Process 2(3):1–15
Google Scholar
Jeyalakshmi J, Revathi A, Venkataramani Y (2016) Integrated models and features based speaker independent emotion recognition. Int J Tele Med Clin Pract 1(3):271–291
Google Scholar
Huang J, Li J, Gong Y (2015) An analysis of convolutional neural networks for speech recognition. In: 2015 IEEE international conference on acoustics, speech and signal processing (ICASSP), Brisbane, QLD, pp 4989–4993
Google Scholar
Palaz D, Magimai Doss M, Collobert R (2015) Convolutional neural networks-based continuous speech recognition using raw speech signal. In: 2015 IEEE international conference on acoustics, speech and signal processing (ICASSP), Brisbane, QLD, pp 4295–4299
Google Scholar
Abdel-Hamid O, Mohamed A, Jiang H, Deng L, Penn G, Yu D (2014) Convolutional neural networks for speech recognition. IEEE/ACM Trans Audio Speech Lang Process 22(10):1533–1545
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of ECE, School of EEE, SASTRA Deemed to be University, Thanjavur, Tamil Nadu, India
A. Revathi, R. Nagakrishnan, D. Vishnu Vashista & Kuppa Sai Sri Teja
Department of CSE, School of Computing, SASTRA Deemed to be University, Thanjavur, Tamil Nadu, India
N. Sasikaladevi

Authors

A. Revathi
View author publications
You can also search for this author in PubMed Google Scholar
R. Nagakrishnan
View author publications
You can also search for this author in PubMed Google Scholar
D. Vishnu Vashista
View author publications
You can also search for this author in PubMed Google Scholar
Kuppa Sai Sri Teja
View author publications
You can also search for this author in PubMed Google Scholar
N. Sasikaladevi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Kuppa Sai Sri Teja .

Editor information

Editors and Affiliations

Department of Electronics and Communication Engineering, Mar Baselios College of Engineering and Technology, Thiruvananthapuram, Kerala, India
J. Jayakumari
Department of Electrical and Computer Engineering, Aristotle University of Thessaloniki, Thessaloniki, Greece
George K. Karagiannidis
School of Electrical and Electronic Engineering, Nanyang Technological University, Singapore, Singapore
Maode Ma
Department of Computer Science and Engineering, Daffodil International University, Dhaka, Bangladesh
Syed Akhter Hossain

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Revathi, A., Nagakrishnan, R., Vishnu Vashista, D., Teja, K.S.S., Sasikaladevi, N. (2020). Emotion Recognition from Speech Using Perceptual Features and Convolutional Neural Networks. In: Jayakumari, J., Karagiannidis, G., Ma, M., Hossain, S. (eds) Advances in Communication Systems and Networks . Lecture Notes in Electrical Engineering, vol 656. Springer, Singapore. https://doi.org/10.1007/978-981-15-3992-3_29

Download citation

DOI: https://doi.org/10.1007/978-981-15-3992-3_29
Published: 14 June 2020
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-3991-6
Online ISBN: 978-981-15-3992-3
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics