Abstract
This paper is to prove that lip-movement is an available channel for information acquiring. The reasoning is given by describing two kinds of valid applications, which are constructed on lip movement information only. One is lip-reading, the other is lip-movement utterance recognition. The accuracy of the former system with speaker-dependent could achieve 68%, and of the latter achieves over 99.5% for test-independent (TI) and nearly 100% for test-dependent (TD) in experiments till now. From this conclusion, it could be easily got that lip-reading channel is an effective one and can be applied independently.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Petajan, E.D.: Automatic Lipreading to Enhance Speech Recognition. Ph.D. thesis of University of Illinois at Urbana_Champain, vol. 1, p. 261 (1984)
Yao, H.: Research and implementation on some key problems of lip-reading recognition technology. Ph.D. thesis of Department of Computer Science and Technology, Harbin Institute of Technology, China (2003)
Matthews, I., Cootes, T.F., Bangham, J.A., Cox, J.A., Harvey, R.: Extraction of visual features for lipreading. IEEE Transactions on Pattern Analysis and Machine Intelligence 24(2), 198–213 (2002)
Hong, X., Yao, H., Xu, M.: BioModel Database and Its Material Segmentation for Lip-Reading Recognition on Sentence (in Chinese). Chinese Journal Computer Engineering and Application 41(3), 174–177 (2005)
Luettin, J.: Visual Speech and Speaker Recognition. Ph.D. thesis of Department of Computer Science University of Sheffield, UK, vol. 1, p. 14 (1997)
Nefian, A.V., Liang, L.H., Fu, T., Liu, X.X.: A Bayesian Approach to Audio-Visual Speaker Identification. In: Kittler, J., Nixon, M.S. (eds.) AVBPA 2003, pp. 761–769 (2003)
Nefian, A.V., Liang, L.H., Liu, X.X., Pi, X., Murphy, K.: Dynamic Bayesian networks for audio-visual speech recognition. EURASIP, Journal of Applied Signal Processing 2002(11), 1274–1288 (2002)
Neti, C., Potamianos, G., Luettin, J., Matthews, I., Vergyri, D., Sison, J., Mashari, A., Zhou, J.: Audio visual speech recognition. In: Final Workshop 2000 Report (2000)
Shan, S.: Study on Some Key Issuses in Face Recognition. Ph.D. thesis of Institute of Computing Technology, Chinese Academy of Sciences, China (2003)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Hong, X., Yao, H., Liu, Q., Chen, R. (2005). An Information Acquiring Channel —— Lip Movement. In: Tao, J., Tan, T., Picard, R.W. (eds) Affective Computing and Intelligent Interaction. ACII 2005. Lecture Notes in Computer Science, vol 3784. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11573548_30
Download citation
DOI: https://doi.org/10.1007/11573548_30
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-29621-8
Online ISBN: 978-3-540-32273-3
eBook Packages: Computer ScienceComputer Science (R0)