Skip to main content

Robust Detection and Localization of Human Action in Video

  • Conference paper
Advances in Multimedia Modeling

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 7733))

  • 1957 Accesses

Abstract

We propose a robust and efficient method for accurate detecting and localizing complex human action in video in space and time dimensions using spatio-temporal templates. A simple but effective motion descriptor based on the motion-compensated frame difference is designed for template representation, which is resistant to the deformation of posture and cluttered and moving background. A multi-step filtering scheme is adopted to speed up the target candidates localization and matching to the templates. For the template sequence to video registration, we present an extended continuous dynamic programming technique which can compute the matching scores for multiple trajectories simultaneously. Extensive experimental results on different videos have demonstrated the effectiveness of the proposed method.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Bobick, A., Davis, J.: The Representation and Recognition of Action Using Temporal Templates. IEEE Trans. on PAMI 23(3), 257–267 (2001)

    Article  Google Scholar 

  2. Yamato, J., et al.: Recognizing Human Action in Time-Sequential Images using Hidden Markov Model. In: CVPR (1992)

    Google Scholar 

  3. Efros, A.A., Berg, A.C., Mori, G., Malik, J.: Recognizing action at a distance. In: ICCV (2003)

    Google Scholar 

  4. Zhu, G.-Y., Xu, C.S., Gao, W., Huang, Q.: Action Recognition in Broadcast Tennis Video Using Optical Flow and Support Vector Machine. In: Huang, T.S., Sebe, N., Lew, M., Pavlović, V., Kölsch, M., Galata, A., Kisačanin, B. (eds.) HCI/ECCV 2006. LNCS, vol. 3979, pp. 89–98. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  5. Song, Y., Zheng, Y.-T., Tang, S., Zhou, X., Zhang, Y., Lin, S., Chua, T.-S.: Localized Multiple Kernel Learning for Realistic Human Action Recognition in Videos. IEEE Trans. Circuits Syst. Video Techn. 21(9), 1193–1202 (2011)

    Article  Google Scholar 

  6. Li, H., Tang, J., Wu, S., Zhang, Y., Lin, S.: Automatic Detection and Analysis of Player Action in Moving Background Sports Video Sequences. IEEE Trans. Circuits Syst. Video Techn. 20(3), 351–364 (2010)

    Article  Google Scholar 

  7. Scovanner, P., Ali, S., Shah, M.: A 3-dimensional sift descriptor and its application to action recognition. ACM Multimedia (2007)

    Google Scholar 

  8. Sun, J., Wu, X., Yan, S., Cheong, L.F., Chua, T.-S., Li, J.: Hierarchical spatio-temporal context modeling for action recognition. In: CVPR (2009)

    Google Scholar 

  9. Shechtman, E., Irani, M.: Space-Time Behavior Based Correlation. In: CVPR (2005)

    Google Scholar 

  10. Jiang, H., Li, Z.N., Drew, M.S.: Detecting Human Action in Active Video. In: ICME (2006)

    Google Scholar 

  11. Viola, P., Jones, M.: Rapid Object Detection using a Boosted Cascade of Simple Features. In: CVPR (2001)

    Google Scholar 

  12. Yilmaz, A., Javed, O., Shah, M.: ACM Computing Surveys 38(4) (2006)

    Article  Google Scholar 

  13. Zhang, H., Guo, Y.: Facial Expression Recognition Using Continuous Dynamic Programming. In: ICCV Workshop on RATFGRT (2001)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Li, H., Sun, F., Guan, Y. (2013). Robust Detection and Localization of Human Action in Video. In: Li, S., et al. Advances in Multimedia Modeling. Lecture Notes in Computer Science, vol 7733. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-35728-2_25

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-35728-2_25

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-35727-5

  • Online ISBN: 978-3-642-35728-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics