Skip to main content

Speech Emotional Recognition Using Global and Time Sequence Structure Features with MMD

  • Conference paper
Affective Computing and Intelligent Interaction (ACII 2005)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 3784))

Abstract

In this paper, combined features of global and time-sequence were used as the characteristic parameters for speech emotional recognition. A new method based on formula of MMD (Modified Mahalanobis Distance) was proposed to decrease the estimated errors and simplify the calculation. Four emotions including happiness, anger, surprise and sadness are considered in the paper. 1000 recognizing sentences collected from 10 speakers were used to demonstrate the effectiveness of the new method. The average emotion recognition rate reached at 95%. Comparison with method of MQDF [1] (Modified quadratic discriminant function), Data analysis also displayed that the MMD is better than MQDF.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Cai, L., Jiang, C., Wang, Z., Zhao, L., Zou, C.: A Method Combining The Global And Time Series Structure Features For Emotion Recognition In Speech. In: IEEE Int. Conf. Neural Networks & Signal Processing (2003)

    Google Scholar 

  2. Iida, A., Campbell, N., Iga, S., Higuchi, F., Yasumura, M.: Acoustic Nature and perceptual testing of corpora of emotional speech

    Google Scholar 

  3. Banse, R., Scherer, K.R.: Acoustic profiles in vocal emotion expression. Journal of Personality and Social Psychology 70(3) (1996)

    Google Scholar 

  4. Mozziconacc, S.: Speech Variability and Emotion: Production and Perception. Technische Universiteit Eindhoven, Eindhoven (1998)

    Google Scholar 

  5. Scherer, K.R.: Speech and Emotional States. In: Darby, J.K. (ed.) Speech Evaluation in Psychiatry. Grune and Stratton, New York (1981)

    Google Scholar 

  6. Soskin, W.F., Kauffman, P.E.: Judgements of Emotions in Word-free Voice Samples. Journal of Communication (1961)

    Google Scholar 

  7. Li, Z., Xiangmin, Q., Cairong, Z., Zhenyang, W.: A Study on Emotional Recognition in Speech Signal. Journal of Software 12(7) (2001)

    Google Scholar 

  8. Cowie, R.: Emotion Recognition in Human-Computer Interaction. IEEE Signal Processing Magazine 18(1), 32–80 (2001)

    Article  Google Scholar 

  9. Muraka, S.: Emotional Constituents in Text and Emotional Components in Speech, Ph. D. Theis, Kyoto, Kyoto Institute of Technology, Japan (1998)

    Google Scholar 

  10. Shigenaga, M.: Features of Emotionally Uttered Speech Revealed by Discriminant Analysis (VI), The preprint of the acoustical society of Japan, pp. 2–18 (1999)

    Google Scholar 

  11. Li, Z., Xiangmin, Q., Cairong, Z., Zhenyang, W.: A Study on Emotional Feature Analysis and Recognition in Speech Signal. Journal of China Institute of Communications 21(1), 18–25 (2000)

    Google Scholar 

  12. Li, Z., Xiangmin, Q., Cairong, Z., Zhenyang, W.: A Study on Emotional Feature Extract in Speech signal. Data Collection and Process 15(1), 120–123 (2000)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Zhao, L., Cao, Y., Wang, Z., Zou, C. (2005). Speech Emotional Recognition Using Global and Time Sequence Structure Features with MMD. In: Tao, J., Tan, T., Picard, R.W. (eds) Affective Computing and Intelligent Interaction. ACII 2005. Lecture Notes in Computer Science, vol 3784. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11573548_40

Download citation

  • DOI: https://doi.org/10.1007/11573548_40

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-29621-8

  • Online ISBN: 978-3-540-32273-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics