Emotion Recognition Using Voice Based on Emotion-Sensitive Frequency Ranges

Hyun, Kyung Hak; Kim, Eun Ho; Kwak, Yoon Keun

doi:10.1007/978-3-540-73424-6_25

Kyung Hak Hyun⁴,
Eun Ho Kim⁴ &
Yoon Keun Kwak⁴

Part of the book series: Studies in Computational Intelligence ((SCI,volume 76))

1466 Accesses

To date, study on emotion recognition has focused on detecting the values of pitch, formant, or cepstrum from the variation of speech according to changing emotions. However, the values of emotional speech features vary by not only emotions but also speakers. Because each speaker has unique frequency characteristics, it is difficult to apply the same manner to different speakers. Therefore, in the present work we considered the personal characteristics of speech. To this end, we analyzed the frequency characteristics for a user and chose the frequency ranges that are sensitive to variation of emotion. From these results, we designed a personal filter bank and extracted emotional speech features using this filter bank. This method showed about 90% recognition rate although there are differences among individuals.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Hardcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Ang J, Dhillon R, Krupski A, Shriberg E, Stolcke A (2002) Prosody-based automatic detection of annoyance and frustration in human-computer dialog. In: Hansen JHL, Pellom B (eds) International Conference on Spoken Language Processing, ISCA Archive, pp 2037-2040
Google Scholar
Bezooijen Rv (1984) The Characteristics and Recognizability of Vocal Expression of Emotions. Walter de Gruyter, Inc., The Netherlands
Google Scholar
Cowie R, Douglas-Cowie E, Tsapatsoulis N, Votsis G, Kollias S, Fellenz W, Taylor JG (2001) Emotion recognition in human-computer interaction. In: Chang S-F, Schneiderman S, (eds) IEEE Signal Processing Magazine, IEEE Signal Processing Society, pp 32-80
Google Scholar
Esau N, Kleinjohann B, Kleinjohann L, Stichling D (2003) MEXI: Machine with Emotionally eXtended Intelligence. In: Abraham A, Köppen M, Franke K (eds) Hybrid Intelligent systems, Design and Application, IOS Press, The Netherlands pp 961-970
Google Scholar
France DJ, Shivavi RG, Silverman S, Silverman M, Wilkes M (2000) Acoustical properties of speech as indicators of depression and suicidal risk. IEEE Trans Biomed Eng 7:829-837
Article Google Scholar
Hyun KH, Kim EH, Kwak YK (2006) Emotion Recognition Using Frequency Ranges Sensitive to Emotion In: Proceeding of 3^rd International Conference on Autonomous Robots and Agents, pp 119-124
Google Scholar
McGilloway S, Cowie R, Douglas-Cowie E, Gielen S, Westerdijk M, Stroeve S (2000) Approaching automatic recognition of emotion from voice: A rough benchmark. In: Proceeding of the ISCA Workshop on Speech and Emotion, pp 207-212
Google Scholar
Nwe TL, Foo SW, De Silva LC (2003) Speech emotion recognition using hidden markov model. Speech Communication, 41:603-623.
Article Google Scholar
Pantic M, Rothkrantz L (2003) Toward an affect-sensitive multimodal humancomputer interaction. In: Trew RJ, Calder J (eds) Proceeding of the IEEE, IEEE, pp 1370-1390
Google Scholar
Quatieri TF (2002) Discrete-Time Speech Signal Processing Principles and Practice, Prentice Hall, New Jersey.
Google Scholar
Rabiner LR, Juang BH (1993) Fundamentals of Speech Recognition, Prentice Hall, Englewood Cliffs, New Jersey
Google Scholar
Schiel F, Steininger S, Turk U (2002) The Smartkom multimodal copus at BAS. In: The 3^rd International Conference on Language Resources and Evaluation, pp 35-41
Google Scholar
Tolkmitt FJ, Scherer KR (1986) Effect of experimentally induced stress on vocal parameters. J Exp Psychol: Hum Percept Perform 12:302-313
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Mechanical Engineering, Korea Advanced Institute of Science and Technology, Daejeon, Republic of Korea
Kyung Hak Hyun, Eun Ho Kim & Yoon Keun Kwak

Authors

Kyung Hak Hyun
View author publications
You can also search for this author in PubMed Google Scholar
Eun Ho Kim
View author publications
You can also search for this author in PubMed Google Scholar
Yoon Keun Kwak
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Institute of Information Sciences and Technology, Massey University (Turitea Campus), Palmerston North, New Zealand
Subhas Chandra Mukhopadhyay Ph.D
School of Electrical and Electronic Engineering, Singapore Polytechnic, Singapore
Gourab Sen Gupta

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Hyun, K.H., Kim, E.H., Kwak, Y.K. (2007). Emotion Recognition Using Voice Based on Emotion-Sensitive Frequency Ranges. In: Mukhopadhyay, S.C., Gupta, G.S. (eds) Autonomous Robots and Agents. Studies in Computational Intelligence, vol 76. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-73424-6_25

Download citation

DOI: https://doi.org/10.1007/978-3-540-73424-6_25
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-73423-9
Online ISBN: 978-3-540-73424-6
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics