Robust Emotion Recognition using Speaking Rate Features

Rao, K. Sreenivasa; Koolagudi, Shashidhar G.

doi:10.1007/978-1-4614-6360-3_5

K. Sreenivasa Rao³ &
Shashidhar G. Koolagudi³

Part of the book series: SpringerBriefs in Electrical and Computer Engineering ((BRIEFSSPEECHTECH))

948 Accesses
3 Citations

Abstract

In this chapter speaking rate characteristics of speech are explored for discriminating the emotions. In real life, we observe that certain emotions are very active with high speaking rate and some are passive with low speaking rate. With this motivation, in this chapter, we have proposed a two stage emotion recognition system, where the emotions are classified into three broad groups (active, neutral and passive) at the first stage and during second stage emotions in each broad group are further classified. Spectral and prosodic features are explored in each stage for discriminating the emotions. Combination of spectral and prosodic features is observed to be performed better.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

S.G. Koolagudi, K.S. Rao, Two stage emotion recognition based on speaking rate. Int. J. Speech Technol. 14, 35–48 (2011)
Google Scholar
S.G. Koolagudi, S. Ray, K.S. Rao, Emotion classification based on speaking rate, in Communications in Computer and Information Science, ed. by S. Ranka, A. Banerjee, K.K. Biswas, S. Dua, P. Mishra, R. Moona, S.-H. Poon, C.-L. Wang. International Conference on Contemporary Computing, vol. 94, pp. 316–327, Springer, USA, 6–8 Aug 2010
Google Scholar
K.S. Rao, B. Yegnanarayana, Modeling durations of syllables using neural networks. Comput. Speech Lang. 21, 282–295 (2007)
Google Scholar
A.L. Francis, H.C. Nusbaum, Paying attention to speaking rate, in Fourth International Conference on Spoken Language, 1996 ICSLP 96, (Philadelphia, PA, USA), pp. 1537–1540 (V3), IEEE, October 1996. Center for Computational Psychology, Department of Psychology, The University of Chicago
Google Scholar
J. Yuan, M. Liberman, C. Cieri, Towards an integrated understanding of speaking rate in conversation, in Interspeech 2006, (Pittsburgh, PA, 2006), pp. 541–544
Google Scholar
M.S.H. Reddy, K.S. Kumar, S. Guruprasad, B. Yegnanarayana, Subsegmental features for analysis of speech at different speaking rates, in International Conference on Natural Language Processing, (Macmillan, India, 2009), pp. 75–80
Google Scholar
A. LI, Y. ZU, Speaking rate effects on discourse prosody in standard chinese, in Fourth International Conference on Speech Prosody, (Campinas, Brazil, 2008), pp. 449–452, 6–9 May 2008
Google Scholar
H. Yang, W. Guo, Q. Liang, A speaking rate adjustable digital speech repeater for listening comprehension in second-language learning, in International Conference on Computer Science and, Software Engineering, vol. 5, pp. 893–896, 12–14 Dec 2008
Google Scholar
S.G. Koolagudi, S. Maity, V.A. Kumar, S. Chakrabarti, K.S. Rao, IITKGP-SESC : speech database for emotion analysis. Communications in Computer and Information Science, JIIT University, Noida, India: Springer, ISSN: 1865–0929 ed., 17–19 Aug 2009
Google Scholar
E.F. Lussier, N. Morgan, Effects of speaking rate and word frequency on pronunciations in convertional speech. Speech Commun. 29, 137–158 (1999)
Article Google Scholar
M. Richardson, M.Y. Hwang, A. Acero, X. Huang, Improvements on speech recognition for fast talkers, in Eurospeech Conference, Sept 1999
Google Scholar

Download references

Author information

Authors and Affiliations

School of Information Technology, Indian Institute of Technology Kharagpur, Kharagpur, West Bengal, 721302, India
K. Sreenivasa Rao & Shashidhar G. Koolagudi

Authors

K. Sreenivasa Rao
View author publications
You can also search for this author in PubMed Google Scholar
Shashidhar G. Koolagudi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to K. Sreenivasa Rao .

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Rao, K.S., Koolagudi, S.G. (2013). Robust Emotion Recognition using Speaking Rate Features. In: Robust Emotion Recognition using Spectral and Prosodic Features. SpringerBriefs in Electrical and Computer Engineering(). Springer, New York, NY. https://doi.org/10.1007/978-1-4614-6360-3_5

Download citation

DOI: https://doi.org/10.1007/978-1-4614-6360-3_5
Published: 13 January 2013
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4614-6359-7
Online ISBN: 978-1-4614-6360-3
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics