Abstract
Recognizing emotions in the elderly and disabled people is an essential part in knowing whether they need help. In this paper, a study is presented for audio visual emotion recognition based on Hidden Markov Model (HMM). In the realm of audio visual emotion recognition, feature extraction of audio visual emotion and HMM training are very important issues. Emotion features of speech and facial image sequences are extracted andthe HTK toolkit is adopted to train the hidden Markov models for audio, visual and audio visualmulti-stream emotion recognition. In general, the recognition rates of audio-visual multi-stream HMMs are slightly higher than the audio only HMM and visual only HMM, and the recognition rates of negative emotions areslightly than positive emotions.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
http://info.china.alibaba.com/news/detail/v5000441-d1004571420.html (2009)
Chennoukh, S., Gerrits, A., Miet, G., Sluijter, R.: Speech enhancement via frequency bandwidth extension using line spectral frequencies. In: Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2001), vol. 1, pp. 665–668. IEEE (2001)
Lei, X., Dongmei, J., Ravyse, I., Rongchun, Z., Sahli, H., Verhelst, W., Cornelis, J.: Experimental research on audio visual fusion and on model asynchrony for raising speech recognition rate. Journal of Northwestern Polytechnical University 2 (2004)
Martin, O., Kotsia, I., Macq, B., Pitas, I.: The enterface’05 audio-visual emotion database. In: Proceedings of the 22nd International Conference on Data Engineering Workshops, p. 8. IEEE Computer Society (2006)
Paleari, M., Benmokhtar, R., Huet, B.: Evidence theory-based multimodal emotion recognition. Advances in Multimedia Modeling, 435–446 (2009)
Petrushin, V.: Emotion recognition in speech signal: experimental study, development, and application. In: Sixth International Conference on Spoken Language Processing (2000)
Lin, Y., Wei, G., Yang, K.: A research of speech emotion recognition. Journal of Circuits and Systems 12(1), 90–98 (2007)
Schuller, B., Rigoll, G., Lang, M.: Hidden markov model-based speech emotion recognition. In: Proceedings of the 2003 International Conference on Multimedia and Expo., vol. 2, pp. 401–404. IEEE Computer Society (2003)
Sheng, W.: Audio emotion recognition based on hidden markov model. Heilongjiang Science and Technology Information (028), 2–2 (2010) (in Chinese)
Young, S., Evermann, G., Kershaw, D., Moore, G., Odell, J., Ollason, D., Valtchev, V., Woodland, P.: The HTK book, Cambridge, vol. 2 (1999)
Zeng, Z., Pantic, M., Roisman, G., Huang, T.: A survey of affect recognition methods: Audio, visual, and spontaneous expressions. IEEE Transactions on Pattern Analysis and Machine Intelligence 31(1), 39–58 (2009)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag GmbH Berlin Heidelberg
About this chapter
Cite this chapter
Zhao, J., Wu, X., Jiang, D. (2012). Audio-Visual Emotion Recognition Based on Hidden Markov Model. In: Qian, Z., Cao, L., Su, W., Wang, T., Yang, H. (eds) Recent Advances in Computer Science and Information Engineering. Lecture Notes in Electrical Engineering, vol 129. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-25778-0_14
Download citation
DOI: https://doi.org/10.1007/978-3-642-25778-0_14
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-25777-3
Online ISBN: 978-3-642-25778-0
eBook Packages: EngineeringEngineering (R0)