Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
ISO/IEC 14496-1 IS (MPEG-4). Information Technology — Coding of audio-visual objects, Part 1: Systems. www.iso.org
ISO/IEC 14496-2 IS (MPEG-4). Information Technology — Coding of audio-visual objects, Part 2: Visual. www.iso.org
H P Graf et al. Multimodal system for locating heads and faces. Proc Int Conf on Automatic Face and Gesture Recognition, 1996
G Hovden and N Ling. Optimizing facial animation parameters for MPEG-4. IEEE Trans on Consumer Electronics, pp 1354–1359, 2003.
P Ekman et al. Emotions Inside Out. Annals of the New York Academy of Sciences, 2003.
P Ekman et al. (Editors). What the Face Reveals: Basic and Applied Studies of Spontaneous Expression Using the Facial Action Coding System (FACS). Oxford University Press, 1997.
M S Bartlett. Face Image Analysis by Unsupervised Learning. Kluwer, 2001.
E D Petajan. Automatic lipreading to enhance speech recognition. PhD Thesis, University of Illinois at Urbana-Champaign, 1984.
E D Petajan. Automatic lipreading to enhance speech recognition. Proc Globecom Telecommunications Conference, 1984.
C Bregler et al. A hybrid approach to bimodal speech recognition. Proc Annual Asilomar Conf on Signals, Systems, and Computers, 1994.
P L Silsbee and A C Bovik. Medium vocabulary audiovisual speech recognition. In: A J Rubio Ayuso and J M Lopez Soler (Editors). New Advances and Trends in Speech Recognition and Coding. Springer-Verlag, 1995.
A Adjoudani and C Benoit. Audio-visual speech recognition compared across two architectures. Proc Eurospeech, 1995
J Luettin et al. Active shape models for visual speech feature extraction. In: D G Stork and M Hennecke (Editors). Speechreading by Humans and Machines: Models, Systems, and Applications. Springer-Verlag, 1996.
A J Goldschen et al. Continuous optical automatic speech recognition, Proc Asilomar Conf on Signals, Systems, and Computers, 1994.
J F Baldwin et al. Automatic computer lip-reading using fuzzy set theory. Proc AVSP, 1999.
J R Movellan and P Mineiro. A diffusion network approach to visual speech recognition. Proc AVSP, 1999.
P Niyogi et al. Feature based representation for audio-visual speech recognition. Proc AVSP, 1999.
B Talle and A Wichert. Audio-visual sensor fusion with neural architectures. Proc AVSP, 1999.
A Senior et al., On the use of visual information for improving audio-based speaker recognition. Proc AVSP, 1999.
G Fries et al. A tool for designing MPEG-4 compliant expressions and animations on VRML cartoon faces. Proc AVSP, 1999.
M Escher et al. User interactive MPEG-4 compatible facial animation system. Proc IWSNHC3DI, 1999.
N Grammalidis et al. Three-dimensional facial adaptation for MPEG-4 talking heads. J Applied Signal Proc, pp 1005–1020, 2002.
A Raouzaiou et al. Parameterized facial expression synthesis based on MPEG-4. J Applied Signal Proc, pp 1021–1038, 2002.
C Tolga et al. Efficient modeling of virtual humans in MPEG-4. Proc ICME, 2000.
I Pandzic and R Forchheimer. MPEG-4 Facial Animation: The Standard, Implementation and Applications. John Wiley and Sons, 2002.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer Science+Business Media, Inc.
About this chapter
Cite this chapter
Petajan, E. (2005). MPEG-4 Face and Body Animation Coding Applied to HCI. In: Kisačanin, B., Pavlović, V., Huang, T.S. (eds) Real-Time Vision for Human-Computer Interaction. Springer, Boston, MA. https://doi.org/10.1007/0-387-27890-7_15
Download citation
DOI: https://doi.org/10.1007/0-387-27890-7_15
Publisher Name: Springer, Boston, MA
Print ISBN: 978-0-387-27697-7
Online ISBN: 978-0-387-27890-2
eBook Packages: Computer ScienceComputer Science (R0)