An Information Acquiring Channel —— Lip Movement

Hong, Xiaopeng; Yao, Hongxun; Liu, Qinghui; Chen, Rong

doi:10.1007/11573548_30

Xiaopeng Hong¹⁹,
Hongxun Yao¹⁹,
Qinghui Liu¹⁹ &
…
Rong Chen¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 3784))

Included in the following conference series:

International Conference on Affective Computing and Intelligent Interaction

5005 Accesses
4 Citations

Abstract

This paper is to prove that lip-movement is an available channel for information acquiring. The reasoning is given by describing two kinds of valid applications, which are constructed on lip movement information only. One is lip-reading, the other is lip-movement utterance recognition. The accuracy of the former system with speaker-dependent could achieve 68%, and of the latter achieves over 99.5% for test-independent (TI) and nearly 100% for test-dependent (TD) in experiments till now. From this conclusion, it could be easily got that lip-reading channel is an effective one and can be applied independently.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Petajan, E.D.: Automatic Lipreading to Enhance Speech Recognition. Ph.D. thesis of University of Illinois at Urbana_Champain, vol. 1, p. 261 (1984)
Google Scholar
Yao, H.: Research and implementation on some key problems of lip-reading recognition technology. Ph.D. thesis of Department of Computer Science and Technology, Harbin Institute of Technology, China (2003)
Google Scholar
Matthews, I., Cootes, T.F., Bangham, J.A., Cox, J.A., Harvey, R.: Extraction of visual features for lipreading. IEEE Transactions on Pattern Analysis and Machine Intelligence 24(2), 198–213 (2002)
Article Google Scholar
Hong, X., Yao, H., Xu, M.: BioModel Database and Its Material Segmentation for Lip-Reading Recognition on Sentence (in Chinese). Chinese Journal Computer Engineering and Application 41(3), 174–177 (2005)
Google Scholar
Luettin, J.: Visual Speech and Speaker Recognition. Ph.D. thesis of Department of Computer Science University of Sheffield, UK, vol. 1, p. 14 (1997)
Google Scholar
Nefian, A.V., Liang, L.H., Fu, T., Liu, X.X.: A Bayesian Approach to Audio-Visual Speaker Identification. In: Kittler, J., Nixon, M.S. (eds.) AVBPA 2003, pp. 761–769 (2003)
Google Scholar
Nefian, A.V., Liang, L.H., Liu, X.X., Pi, X., Murphy, K.: Dynamic Bayesian networks for audio-visual speech recognition. EURASIP, Journal of Applied Signal Processing 2002(11), 1274–1288 (2002)
Article MATH Google Scholar
Neti, C., Potamianos, G., Luettin, J., Matthews, I., Vergyri, D., Sison, J., Mashari, A., Zhou, J.: Audio visual speech recognition. In: Final Workshop 2000 Report (2000)
Google Scholar
Shan, S.: Study on Some Key Issuses in Face Recognition. Ph.D. thesis of Institute of Computing Technology, Chinese Academy of Sciences, China (2003)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science and Engineering, Harbin Institute of Technology, Harbin, 150001, China
Xiaopeng Hong, Hongxun Yao, Qinghui Liu & Rong Chen

Authors

Xiaopeng Hong
View author publications
You can also search for this author in PubMed Google Scholar
Hongxun Yao
View author publications
You can also search for this author in PubMed Google Scholar
Qinghui Liu
View author publications
You can also search for this author in PubMed Google Scholar
Rong Chen
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

National Laboratory of Pattern Recognition (NLPR), Institute of Automation, Chinese Academy of Sciences,
Jianhua Tao
National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences, Beijing, China
Tieniu Tan
MIT Media Laboratory, 20 Ames Street, 02139, Cambridge, MA, USA
Rosalind W. Picard

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Hong, X., Yao, H., Liu, Q., Chen, R. (2005). An Information Acquiring Channel —— Lip Movement. In: Tao, J., Tan, T., Picard, R.W. (eds) Affective Computing and Intelligent Interaction. ACII 2005. Lecture Notes in Computer Science, vol 3784. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11573548_30

Download citation

DOI: https://doi.org/10.1007/11573548_30
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-29621-8
Online ISBN: 978-3-540-32273-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics