Skip to main content

Audio-Visual Emotion Recognition Based on Hidden Markov Model

  • Chapter
  • First Online:
Recent Advances in Computer Science and Information Engineering

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 129))

  • 208 Accesses

Abstract

Recognizing emotions in the elderly and disabled people is an essential part in knowing whether they need help. In this paper, a study is presented for audio visual emotion recognition based on Hidden Markov Model (HMM). In the realm of audio visual emotion recognition, feature extraction of audio visual emotion and HMM training are very important issues. Emotion features of speech and facial image sequences are extracted andthe HTK toolkit is adopted to train the hidden Markov models for audio, visual and audio visualmulti-stream emotion recognition. In general, the recognition rates of audio-visual multi-stream HMMs are slightly higher than the audio only HMM and visual only HMM, and the recognition rates of negative emotions areslightly than positive emotions.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. http://info.china.alibaba.com/news/detail/v5000441-d1004571420.html (2009)

  2. Chennoukh, S., Gerrits, A., Miet, G., Sluijter, R.: Speech enhancement via frequency bandwidth extension using line spectral frequencies. In: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2001), vol. 1, pp. 665–668. IEEE (2001)

    Google Scholar 

  3. Lei, X., Dongmei, J., Ravyse, I., Rongchun, Z., Sahli, H., Verhelst, W., Cornelis, J.: Experimental research on audio visual fusion and on model asynchrony for raising speech recognition rate. Journal of Northwestern Polytechnical University 2 (2004)

    Google Scholar 

  4. Martin, O., Kotsia, I., Macq, B., Pitas, I.: The enterface’05 audio-visual emotion database. In: Proceedings of the 22nd International Conference on Data Engineering Workshops, p. 8. IEEE Computer Society (2006)

    Google Scholar 

  5. Paleari, M., Benmokhtar, R., Huet, B.: Evidence theory-based multimodal emotion recognition. Advances in Multimedia Modeling, 435–446 (2009)

    Google Scholar 

  6. Petrushin, V.: Emotion recognition in speech signal: experimental study, development, and application. In: Sixth International Conference on Spoken Language Processing (2000)

    Google Scholar 

  7. Lin, Y., Wei, G., Yang, K.: A research of speech emotion recognition. Journal of Circuits and Systems 12(1), 90–98 (2007)

    Google Scholar 

  8. Schuller, B., Rigoll, G., Lang, M.: Hidden markov model-based speech emotion recognition. In: Proceedings of the 2003 International Conference on Multimedia and Expo., vol. 2, pp. 401–404. IEEE Computer Society (2003)

    Google Scholar 

  9. Sheng, W.: Audio emotion recognition based on hidden markov model. Heilongjiang Science and Technology Information (028), 2–2 (2010) (in Chinese)

    Google Scholar 

  10. Young, S., Evermann, G., Kershaw, D., Moore, G., Odell, J., Ollason, D., Valtchev, V., Woodland, P.: The HTK book, Cambridge, vol. 2 (1999)

    Google Scholar 

  11. Zeng, Z., Pantic, M., Roisman, G., Huang, T.: A survey of affect recognition methods: Audio, visual, and spontaneous expressions. IEEE Transactions on Pattern Analysis and Machine Intelligence 31(1), 39–58 (2009)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag GmbH Berlin Heidelberg

About this chapter

Cite this chapter

Zhao, J., Wu, X., Jiang, D. (2012). Audio-Visual Emotion Recognition Based on Hidden Markov Model. In: Qian, Z., Cao, L., Su, W., Wang, T., Yang, H. (eds) Recent Advances in Computer Science and Information Engineering. Lecture Notes in Electrical Engineering, vol 129. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-25778-0_14

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-25778-0_14

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-25777-3

  • Online ISBN: 978-3-642-25778-0

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics