Robust Detection and Localization of Human Action in Video

Li, Haojie; Sun, Fuming; Guan, Yue

doi:10.1007/978-3-642-35728-2_25

Haojie Li⁷,
Fuming Sun⁸ &
Yue Guan⁷

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 7733))

1957 Accesses

Abstract

We propose a robust and efficient method for accurate detecting and localizing complex human action in video in space and time dimensions using spatio-temporal templates. A simple but effective motion descriptor based on the motion-compensated frame difference is designed for template representation, which is resistant to the deformation of posture and cluttered and moving background. A multi-step filtering scheme is adopted to speed up the target candidates localization and matching to the templates. For the template sequence to video registration, we present an extended continuous dynamic programming technique which can compute the matching scores for multiple trajectories simultaneously. Extensive experimental results on different videos have demonstrated the effectiveness of the proposed method.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Bobick, A., Davis, J.: The Representation and Recognition of Action Using Temporal Templates. IEEE Trans. on PAMI 23(3), 257–267 (2001)
Article Google Scholar
Yamato, J., et al.: Recognizing Human Action in Time-Sequential Images using Hidden Markov Model. In: CVPR (1992)
Google Scholar
Efros, A.A., Berg, A.C., Mori, G., Malik, J.: Recognizing action at a distance. In: ICCV (2003)
Google Scholar
Zhu, G.-Y., Xu, C.S., Gao, W., Huang, Q.: Action Recognition in Broadcast Tennis Video Using Optical Flow and Support Vector Machine. In: Huang, T.S., Sebe, N., Lew, M., Pavlović, V., Kölsch, M., Galata, A., Kisačanin, B. (eds.) HCI/ECCV 2006. LNCS, vol. 3979, pp. 89–98. Springer, Heidelberg (2006)
Chapter Google Scholar
Song, Y., Zheng, Y.-T., Tang, S., Zhou, X., Zhang, Y., Lin, S., Chua, T.-S.: Localized Multiple Kernel Learning for Realistic Human Action Recognition in Videos. IEEE Trans. Circuits Syst. Video Techn. 21(9), 1193–1202 (2011)
Article Google Scholar
Li, H., Tang, J., Wu, S., Zhang, Y., Lin, S.: Automatic Detection and Analysis of Player Action in Moving Background Sports Video Sequences. IEEE Trans. Circuits Syst. Video Techn. 20(3), 351–364 (2010)
Article Google Scholar
Scovanner, P., Ali, S., Shah, M.: A 3-dimensional sift descriptor and its application to action recognition. ACM Multimedia (2007)
Google Scholar
Sun, J., Wu, X., Yan, S., Cheong, L.F., Chua, T.-S., Li, J.: Hierarchical spatio-temporal context modeling for action recognition. In: CVPR (2009)
Google Scholar
Shechtman, E., Irani, M.: Space-Time Behavior Based Correlation. In: CVPR (2005)
Google Scholar
Jiang, H., Li, Z.N., Drew, M.S.: Detecting Human Action in Active Video. In: ICME (2006)
Google Scholar
Viola, P., Jones, M.: Rapid Object Detection using a Boosted Cascade of Simple Features. In: CVPR (2001)
Google Scholar
Yilmaz, A., Javed, O., Shah, M.: ACM Computing Surveys 38(4) (2006)
Article Google Scholar
Zhang, H., Guo, Y.: Facial Expression Recognition Using Continuous Dynamic Programming. In: ICCV Workshop on RATFGRT (2001)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Software, Dalian University of Technology, China
Haojie Li & Yue Guan
Liaoning University of Technology, China
Fuming Sun

Authors

Haojie Li
View author publications
You can also search for this author in PubMed Google Scholar
Fuming Sun
View author publications
You can also search for this author in PubMed Google Scholar
Yue Guan
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Microsoft Research Asia, 5 Danling Street, 100080, Beijing, China
Shipeng Li & Tao Mei &
School of Electrical Engineering and Computer Science, University of Ottawa, 800 King Edward, K1N 6N5, Ottawa, ON, Canada
Abdulmotaleb El Saddik
School of Computer and Information, Hefei University of Technology, Road Tunxi 193#, 230009, Hefei, Anhui, China
Meng Wang & Richang Hong &
Department of Information Engineering and Computer Science, University of Trento, ommarive 14, 38100, Trento, Italy
Nicu Sebe
Department of Electrical and Computer Engineering, National University of Singapore, 4 Engineering Drive 3, 117583, Singapore, Singapore
Shuicheng Yan
School of Computing, CLARITY: Centre for Sensor Web Technologies, Dublin City University, Glasnevin, 9, Dublin, Ireland
Cathal Gurrin

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Li, H., Sun, F., Guan, Y. (2013). Robust Detection and Localization of Human Action in Video. In: Li, S., et al. Advances in Multimedia Modeling. Lecture Notes in Computer Science, vol 7733. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-35728-2_25

Download citation

DOI: https://doi.org/10.1007/978-3-642-35728-2_25
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-35727-5
Online ISBN: 978-3-642-35728-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics