Skip to main content

Visual Recognition of Activities, Gestures, Facial Expressions and Speech: An Introduction and a Perspective

  • Chapter
Motion-Based Recognition

Part of the book series: Computational Imaging and Vision ((CIVI,volume 9))

Abstract

Computer vision has started migrating from the peripheral area to the core of computer science and engineering. Multimedia computing and natural human-machine interfaces are providing adequate challenges and motivation to develop techniques that will play key role in the next generation of computing systems. Recognition of objects and events is very important in multimedia systems as well as interfaces. We consider an object a spatial entity and an event a temporal entity. Visual recognition of objects and activities is one of the fastest developing area of computer vision.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 54.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Adelson, E.H. and Niyogi, S. A. Analyzing and recognizing walking figures in XYT. In IEEE CVPR-94, pages 469–474, 1994.

    Google Scholar 

  2. A. Katkere, S. Moezzi, D. Kuramura, P. Kelly, and R. Jain. Towards video-based immersive environments. Multimedia Systems Journal, Spring 1997.

    Google Scholar 

  3. M. C. Allmen. Image Sequence Description Using Spatiotemporal Flow Curves: Toward Motion-Based Recognition. PhD thesis, University of Wisconsin-Madison, 1991.

    Google Scholar 

  4. M. C. Allmen and C. R. Dyer. Cyclic Motion Detection Using Spatiotemporal Surfaces and Curves. In Proc. 10th Int. Conf. Pattern Recognition, pages 365–370, 1990.

    Chapter  Google Scholar 

  5. Bobick, A. F. and Campbell, L.W. Recognition of human body motion using phase space constraints. In IEEE ICCV-95, pages 624–630, 1995.

    Google Scholar 

  6. Cédras, C., and Shah, M. Motion-based recognition: A survey. Image and Vision Computing, 13 (2): 129–155, March 1995.

    Google Scholar 

  7. Cipolla, R.,Okamoto, Y. and Kuno, Y. Robust structure from motion using motion parallax. In IEEE Proceedings on International Conference on Computer Vision, 1993.

    Google Scholar 

  8. Cootes, T. J., taylor, C.J., and Graham, J. Active shape models-their training and application. Computer Vision, and Image Understanding, 61: 38–59, 1995.

    Article  Google Scholar 

  9. Darrell, T., and Pentland, A. Space-time gestures. In CVPR, pages 335–340. IEEE, 1993.

    Google Scholar 

  10. Davis, J., and Shah, M. Three-dimensional gesture recognition. In Asilomar Conference on Signals, Systems, And Computers, 1994.

    Google Scholar 

  11. Davis, J., and Shah, M. Visual gesture recognition. IEE Proceedings Vision, Image and Signal Processing, 141 (2): 101–106, 1994.

    Article  Google Scholar 

  12. Davis, L. S. and Gavrila, D. M. 3-d model-based tracking of human upper body movement: A multi-view approach. In IEEE CVPR-96, pages 73–80, 1996.

    Google Scholar 

  13. Ekman, P. and Friesen, W. A. Facial Action Coding System. Consulting Psychologist Press, 1978.

    Google Scholar 

  14. Fukumoto, M., Mase, K., and Suenaga, Y. Real-time detection of pointing actions for a glove-free interface. In IA PR Workshop on Machine Vision Applications, pages 473–476, December 1992.

    Google Scholar 

  15. D. C. Hogg. Interpreting Images of a Known Moving Object. PhD thesis, University of Sussex, 1984.

    Google Scholar 

  16. Huang, T., Pavlovic, V. Hand gesture modeling, analysis, and synthesis. In Proc. International Workshop on Automatic Face and Gesture Recognition, pages 73–79, 1995.

    Google Scholar 

  17. J. Schlenzig, E. Hunter and R. Jain. Recursive identification of gesture inputs using hidden markov models. In Proc. IEEE Workshop on Applications of Computer Vision, pages 187–194, 1994.

    Chapter  Google Scholar 

  18. Jain, R.C., Militzer, D., and H.-H. Separating non-stationary from stationary scene components in a sequence of real world tv-images. In IJCAI-77, pages 612–618, 1977.

    Google Scholar 

  19. Kang, S.B., and Ikeuchi, K. Toward automatic robot instruction from perception–recognizing a grasp from observation. IEEE Transactions of Robotics and Automation, 9: 432–443, August 1993.

    Article  Google Scholar 

  20. Kass, M., Witkin, A., and Terzopoulos, D. Snakes: Active contour models. In Proceedings of First International Conference on Computer Vision, pages 259–269, London, 1987.

    Google Scholar 

  21. Little, J. and Boyd, J. Describing motion for recognition. Int. Symposium on Computer Vision-95, pages 235–240, November 1995.

    Google Scholar 

  22. Metaxas, D. and Kakadiaris, I.A. Model-based estimation of 3d human motion with occlusion based on active multi-viewpoint selection. In IEEE CVPR-96, pages 81–87, 1996.

    Google Scholar 

  23. Petajan, E. Automatic Liprading to Enhance Speech Recognition. PhD thesis, University of Illinois, 1984.

    Google Scholar 

  24. L.R. Rabiner and B.H. Juang. An Introduction to Hidden Markov Models. IEEE ASSP Magazine, pages 4–16, January 1986.

    Google Scholar 

  25. Rangarajan, K., Allen, Bill, and Shah, M. Matching motion trajectories. Pattern Recognition, 26:595–610, July, 1993.

    Google Scholar 

  26. Rehg, J., and Kanade, T. Visual tracking of high dof articulated structures: an application to human hand tracking. In ECCV, pages 35–46, May 1994.

    Google Scholar 

  27. R. Jain and K. Wakimoto. Multiple perspective interactive video. In Proceedings of the International Conference on Multimedia Computing and Systems, pages 202211. Computer Society Press, May 15–18 1995.

    Google Scholar 

  28. Stork, D. and Hennecke, M. Speechreading by humans and machines. Springer, 1996.

    Google Scholar 

  29. Baudel T. and Beaudouin-Lafon M. Charade: Remote control of objects using freehand gestures. CA CM, pages 28–35, July 1993.

    Google Scholar 

  30. Tsai, Ping-Sing, Keiter, K., Kasparis, T., and Shah, M. Cyclic motion detection. Pattern Recognition, 27 (12), 1994.

    Google Scholar 

  31. Turk, M., and Pentland, A. Eigenfaces for recognition. Journal of Cognitive Neuroscience, pages 71–86, 1991.

    Google Scholar 

  32. Williams, D. and Shah, M. Greedy algorithm for active contour and curvature estimation. Computer Vision, Graphics, and Image Processing, pages 14–26, January, 1992.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 1997 Springer Science+Business Media Dordrecht

About this chapter

Cite this chapter

Shah, M., Jain, R. (1997). Visual Recognition of Activities, Gestures, Facial Expressions and Speech: An Introduction and a Perspective. In: Shah, M., Jain, R. (eds) Motion-Based Recognition. Computational Imaging and Vision, vol 9. Springer, Dordrecht. https://doi.org/10.1007/978-94-015-8935-2_1

Download citation

  • DOI: https://doi.org/10.1007/978-94-015-8935-2_1

  • Publisher Name: Springer, Dordrecht

  • Print ISBN: 978-90-481-4870-7

  • Online ISBN: 978-94-015-8935-2

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics