Visual Recognition of Activities, Gestures, Facial Expressions and Speech: An Introduction and a Perspective

Shah, Mubarak; Jain, Ramesh

doi:10.1007/978-94-015-8935-2_1

Mubarak Shah⁴ &
Ramesh Jain⁵

Part of the book series: Computational Imaging and Vision ((CIVI,volume 9))

310 Accesses
4 Citations

Abstract

Computer vision has started migrating from the peripheral area to the core of computer science and engineering. Multimedia computing and natural human-machine interfaces are providing adequate challenges and motivation to develop techniques that will play key role in the next generation of computing systems. Recognition of objects and events is very important in multimedia systems as well as interfaces. We consider an object a spatial entity and an event a temporal entity. Visual recognition of objects and activities is one of the fastest developing area of computer vision.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Hardcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Adelson, E.H. and Niyogi, S. A. Analyzing and recognizing walking figures in XYT. In IEEE CVPR-94, pages 469–474, 1994.
Google Scholar
A. Katkere, S. Moezzi, D. Kuramura, P. Kelly, and R. Jain. Towards video-based immersive environments. Multimedia Systems Journal, Spring 1997.
Google Scholar
M. C. Allmen. Image Sequence Description Using Spatiotemporal Flow Curves: Toward Motion-Based Recognition. PhD thesis, University of Wisconsin-Madison, 1991.
Google Scholar
M. C. Allmen and C. R. Dyer. Cyclic Motion Detection Using Spatiotemporal Surfaces and Curves. In Proc. 10th Int. Conf. Pattern Recognition, pages 365–370, 1990.
Chapter Google Scholar
Bobick, A. F. and Campbell, L.W. Recognition of human body motion using phase space constraints. In IEEE ICCV-95, pages 624–630, 1995.
Google Scholar
Cédras, C., and Shah, M. Motion-based recognition: A survey. Image and Vision Computing, 13 (2): 129–155, March 1995.
Google Scholar
Cipolla, R.,Okamoto, Y. and Kuno, Y. Robust structure from motion using motion parallax. In IEEE Proceedings on International Conference on Computer Vision, 1993.
Google Scholar
Cootes, T. J., taylor, C.J., and Graham, J. Active shape models-their training and application. Computer Vision, and Image Understanding, 61: 38–59, 1995.
Article Google Scholar
Darrell, T., and Pentland, A. Space-time gestures. In CVPR, pages 335–340. IEEE, 1993.
Google Scholar
Davis, J., and Shah, M. Three-dimensional gesture recognition. In Asilomar Conference on Signals, Systems, And Computers, 1994.
Google Scholar
Davis, J., and Shah, M. Visual gesture recognition. IEE Proceedings Vision, Image and Signal Processing, 141 (2): 101–106, 1994.
Article Google Scholar
Davis, L. S. and Gavrila, D. M. 3-d model-based tracking of human upper body movement: A multi-view approach. In IEEE CVPR-96, pages 73–80, 1996.
Google Scholar
Ekman, P. and Friesen, W. A. Facial Action Coding System. Consulting Psychologist Press, 1978.
Google Scholar
Fukumoto, M., Mase, K., and Suenaga, Y. Real-time detection of pointing actions for a glove-free interface. In IA PR Workshop on Machine Vision Applications, pages 473–476, December 1992.
Google Scholar
D. C. Hogg. Interpreting Images of a Known Moving Object. PhD thesis, University of Sussex, 1984.
Google Scholar
Huang, T., Pavlovic, V. Hand gesture modeling, analysis, and synthesis. In Proc. International Workshop on Automatic Face and Gesture Recognition, pages 73–79, 1995.
Google Scholar
J. Schlenzig, E. Hunter and R. Jain. Recursive identification of gesture inputs using hidden markov models. In Proc. IEEE Workshop on Applications of Computer Vision, pages 187–194, 1994.
Chapter Google Scholar
Jain, R.C., Militzer, D., and H.-H. Separating non-stationary from stationary scene components in a sequence of real world tv-images. In IJCAI-77, pages 612–618, 1977.
Google Scholar
Kang, S.B., and Ikeuchi, K. Toward automatic robot instruction from perception–recognizing a grasp from observation. IEEE Transactions of Robotics and Automation, 9: 432–443, August 1993.
Article Google Scholar
Kass, M., Witkin, A., and Terzopoulos, D. Snakes: Active contour models. In Proceedings of First International Conference on Computer Vision, pages 259–269, London, 1987.
Google Scholar
Little, J. and Boyd, J. Describing motion for recognition. Int. Symposium on Computer Vision-95, pages 235–240, November 1995.
Google Scholar
Metaxas, D. and Kakadiaris, I.A. Model-based estimation of 3d human motion with occlusion based on active multi-viewpoint selection. In IEEE CVPR-96, pages 81–87, 1996.
Google Scholar
Petajan, E. Automatic Liprading to Enhance Speech Recognition. PhD thesis, University of Illinois, 1984.
Google Scholar
L.R. Rabiner and B.H. Juang. An Introduction to Hidden Markov Models. IEEE ASSP Magazine, pages 4–16, January 1986.
Google Scholar
Rangarajan, K., Allen, Bill, and Shah, M. Matching motion trajectories. Pattern Recognition, 26:595–610, July, 1993.
Google Scholar
Rehg, J., and Kanade, T. Visual tracking of high dof articulated structures: an application to human hand tracking. In ECCV, pages 35–46, May 1994.
Google Scholar
R. Jain and K. Wakimoto. Multiple perspective interactive video. In Proceedings of the International Conference on Multimedia Computing and Systems, pages 202211. Computer Society Press, May 15–18 1995.
Google Scholar
Stork, D. and Hennecke, M. Speechreading by humans and machines. Springer, 1996.
Google Scholar
Baudel T. and Beaudouin-Lafon M. Charade: Remote control of objects using freehand gestures. CA CM, pages 28–35, July 1993.
Google Scholar
Tsai, Ping-Sing, Keiter, K., Kasparis, T., and Shah, M. Cyclic motion detection. Pattern Recognition, 27 (12), 1994.
Google Scholar
Turk, M., and Pentland, A. Eigenfaces for recognition. Journal of Cognitive Neuroscience, pages 71–86, 1991.
Google Scholar
Williams, D. and Shah, M. Greedy algorithm for active contour and curvature estimation. Computer Vision, Graphics, and Image Processing, pages 14–26, January, 1992.
Google Scholar

Download references

Author information

Authors and Affiliations

Computer Vision Lab Computer Science Department, University of Central Florida, Orlando, FL, 32816, USA
Mubarak Shah
Electrical and Computer Engineering, University of California, San Diego, La Jolla, CA, 92093-0407, USA
Ramesh Jain

Authors

Mubarak Shah
View author publications
You can also search for this author in PubMed Google Scholar
Ramesh Jain
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Computer Vision Laboratory, Computer Science Department, University of Central Florida, 32816, Orlando, Florida, USA
Mubarak Shah
Electrical and Computer Engineering, University of California, San Diego, 92137, San Diego, California, USA
Ramesh Jain

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Shah, M., Jain, R. (1997). Visual Recognition of Activities, Gestures, Facial Expressions and Speech: An Introduction and a Perspective. In: Shah, M., Jain, R. (eds) Motion-Based Recognition. Computational Imaging and Vision, vol 9. Springer, Dordrecht. https://doi.org/10.1007/978-94-015-8935-2_1

Download citation

DOI: https://doi.org/10.1007/978-94-015-8935-2_1
Publisher Name: Springer, Dordrecht
Print ISBN: 978-90-481-4870-7
Online ISBN: 978-94-015-8935-2
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics