Abstract
Computer vision has started migrating from the peripheral area to the core of computer science and engineering. Multimedia computing and natural human-machine interfaces are providing adequate challenges and motivation to develop techniques that will play key role in the next generation of computing systems. Recognition of objects and events is very important in multimedia systems as well as interfaces. We consider an object a spatial entity and an event a temporal entity. Visual recognition of objects and activities is one of the fastest developing area of computer vision.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Adelson, E.H. and Niyogi, S. A. Analyzing and recognizing walking figures in XYT. In IEEE CVPR-94, pages 469–474, 1994.
A. Katkere, S. Moezzi, D. Kuramura, P. Kelly, and R. Jain. Towards video-based immersive environments. Multimedia Systems Journal, Spring 1997.
M. C. Allmen. Image Sequence Description Using Spatiotemporal Flow Curves: Toward Motion-Based Recognition. PhD thesis, University of Wisconsin-Madison, 1991.
M. C. Allmen and C. R. Dyer. Cyclic Motion Detection Using Spatiotemporal Surfaces and Curves. In Proc. 10th Int. Conf. Pattern Recognition, pages 365–370, 1990.
Bobick, A. F. and Campbell, L.W. Recognition of human body motion using phase space constraints. In IEEE ICCV-95, pages 624–630, 1995.
Cédras, C., and Shah, M. Motion-based recognition: A survey. Image and Vision Computing, 13 (2): 129–155, March 1995.
Cipolla, R.,Okamoto, Y. and Kuno, Y. Robust structure from motion using motion parallax. In IEEE Proceedings on International Conference on Computer Vision, 1993.
Cootes, T. J., taylor, C.J., and Graham, J. Active shape models-their training and application. Computer Vision, and Image Understanding, 61: 38–59, 1995.
Darrell, T., and Pentland, A. Space-time gestures. In CVPR, pages 335–340. IEEE, 1993.
Davis, J., and Shah, M. Three-dimensional gesture recognition. In Asilomar Conference on Signals, Systems, And Computers, 1994.
Davis, J., and Shah, M. Visual gesture recognition. IEE Proceedings Vision, Image and Signal Processing, 141 (2): 101–106, 1994.
Davis, L. S. and Gavrila, D. M. 3-d model-based tracking of human upper body movement: A multi-view approach. In IEEE CVPR-96, pages 73–80, 1996.
Ekman, P. and Friesen, W. A. Facial Action Coding System. Consulting Psychologist Press, 1978.
Fukumoto, M., Mase, K., and Suenaga, Y. Real-time detection of pointing actions for a glove-free interface. In IA PR Workshop on Machine Vision Applications, pages 473–476, December 1992.
D. C. Hogg. Interpreting Images of a Known Moving Object. PhD thesis, University of Sussex, 1984.
Huang, T., Pavlovic, V. Hand gesture modeling, analysis, and synthesis. In Proc. International Workshop on Automatic Face and Gesture Recognition, pages 73–79, 1995.
J. Schlenzig, E. Hunter and R. Jain. Recursive identification of gesture inputs using hidden markov models. In Proc. IEEE Workshop on Applications of Computer Vision, pages 187–194, 1994.
Jain, R.C., Militzer, D., and H.-H. Separating non-stationary from stationary scene components in a sequence of real world tv-images. In IJCAI-77, pages 612–618, 1977.
Kang, S.B., and Ikeuchi, K. Toward automatic robot instruction from perception–recognizing a grasp from observation. IEEE Transactions of Robotics and Automation, 9: 432–443, August 1993.
Kass, M., Witkin, A., and Terzopoulos, D. Snakes: Active contour models. In Proceedings of First International Conference on Computer Vision, pages 259–269, London, 1987.
Little, J. and Boyd, J. Describing motion for recognition. Int. Symposium on Computer Vision-95, pages 235–240, November 1995.
Metaxas, D. and Kakadiaris, I.A. Model-based estimation of 3d human motion with occlusion based on active multi-viewpoint selection. In IEEE CVPR-96, pages 81–87, 1996.
Petajan, E. Automatic Liprading to Enhance Speech Recognition. PhD thesis, University of Illinois, 1984.
L.R. Rabiner and B.H. Juang. An Introduction to Hidden Markov Models. IEEE ASSP Magazine, pages 4–16, January 1986.
Rangarajan, K., Allen, Bill, and Shah, M. Matching motion trajectories. Pattern Recognition, 26:595–610, July, 1993.
Rehg, J., and Kanade, T. Visual tracking of high dof articulated structures: an application to human hand tracking. In ECCV, pages 35–46, May 1994.
R. Jain and K. Wakimoto. Multiple perspective interactive video. In Proceedings of the International Conference on Multimedia Computing and Systems, pages 202211. Computer Society Press, May 15–18 1995.
Stork, D. and Hennecke, M. Speechreading by humans and machines. Springer, 1996.
Baudel T. and Beaudouin-Lafon M. Charade: Remote control of objects using freehand gestures. CA CM, pages 28–35, July 1993.
Tsai, Ping-Sing, Keiter, K., Kasparis, T., and Shah, M. Cyclic motion detection. Pattern Recognition, 27 (12), 1994.
Turk, M., and Pentland, A. Eigenfaces for recognition. Journal of Cognitive Neuroscience, pages 71–86, 1991.
Williams, D. and Shah, M. Greedy algorithm for active contour and curvature estimation. Computer Vision, Graphics, and Image Processing, pages 14–26, January, 1992.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1997 Springer Science+Business Media Dordrecht
About this chapter
Cite this chapter
Shah, M., Jain, R. (1997). Visual Recognition of Activities, Gestures, Facial Expressions and Speech: An Introduction and a Perspective. In: Shah, M., Jain, R. (eds) Motion-Based Recognition. Computational Imaging and Vision, vol 9. Springer, Dordrecht. https://doi.org/10.1007/978-94-015-8935-2_1
Download citation
DOI: https://doi.org/10.1007/978-94-015-8935-2_1
Publisher Name: Springer, Dordrecht
Print ISBN: 978-90-481-4870-7
Online ISBN: 978-94-015-8935-2
eBook Packages: Springer Book Archive