Human Action Recognition: A Survey

Fu, Meixia; Chen, Na; Huang, Zhongjie; Ni, Kaili; Liu, Yuhao; Sun, Songlin; Ma, Xiaomei

doi:10.1007/978-981-13-7123-3_9

Meixia Fu^37,38,39,
Na Chen^37,38,39,
Zhongjie Huang^37,38,39,
Kaili Ni^37,38,39,
Yuhao Liu^37,38,39,
Songlin Sun^37,38,39 &
…
Xiaomei Ma⁴⁰

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 550))

Included in the following conference series:

International Conference On Signal And Information Processing, Networking And Computers

1549 Accesses
2 Citations

Abstract

In this paper, we provide a comprehensive survey in human action recognition and prediction, which has always been a universal and critical area in computer vision. Human action recognition is the first step for a machine to understand and percept the nature, which is small part in machine perception. Human action prediction is the higher layer than human action recognition that is small part in machine cognition, which would give the machine the ability of imagination and reasoning. Here, we only discuss human action recognition from two methodologies that is based on presentations and deep learning, separately. Then, 4 public datasets of human action recognition are descripted closely. Some challenges in dataset are also proposed because of the significance to the development of computer vision. Meanwhile, we compare and summarize recent-published research achievements under deep learning. In the end, we conclude about mentioned methods and future challenges to work on for computer vision.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Moeslund, T.B., Granum, E.: A survey of computer vision-based human motion capture. Comput. Vis. Image Underst. 81(3), 231–268 (2001)
Article Google Scholar
Poppe, R.: A survey on vision-based human action recognition. Image Vis. Comput. 28(6), 976–990 (2010)
Article Google Scholar
Vishwakarma, S.: A survey on activity recognition and behavior understanding in video surveillance. Vis. Comput. 29(10), 983–1009 (2013)
Article Google Scholar
Bobick, A.F., Davis, J.W.: The recognition of human movement using temporal templates. IEEE Computer Society (2001)
Google Scholar
Laptev, I., Lindeberg, T.: On space-time interest points. Int. J. Comput. Vision 64(2–3), 107–123 (2005)
Article Google Scholar
Shechtman, E., Irani, M.: Space-time behavior based correlation. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 405–412. IEEE Computer Society (2005)
Google Scholar
Xu, W., Yang, M., et al.: 3D convolutional neural networks for human action recognition. IEEE Trans. Pattern Anal. Mach. Intell. 35(1), 221–231 (2013)
Article Google Scholar
Simonyan, K., Zisserman, A.: Two-stream convolutional networks for action recognition in videos. In: International Conference on Neural Information Processing Systems, pp. 568–576. MIT Press (2014)
Google Scholar
Wang, L., Qiao, Y., Tang, X.: Action recognition with trajectory-pooled deep-convolutional descriptors. In: Computer Vision and Pattern Recognition, pp. 4305–4314. IEEE (2015)
Google Scholar
Varol, G., Laptev, I., Schmid, C.: Long-term temporal convolutions for action recognition. IEEE Trans. Pattern Anal. Mach. Intell. 40, 1510–1517 (2018)
Article Google Scholar
Natarajan, P., Nevatia, R.: Coupled hidden semi Markov models for activity recognition. In: IEEE Workshop on Motion and Video Computing, p. 10. IEEE Computer Society (2007)
Google Scholar
Kellokumpu, V., Zhao, G., Pietikäinen, M.: Human activity recognition using a dynamic texture based method. In: British Machine Vision Conference, Leeds. DBLP, September 2008
Google Scholar
Gorelick, L., Blank, M., Shechtman, E., et al.: Actions as space-time shapes. IEEE Trans. Pattern Anal. Mach. Intell. 29(12), 2247–2253 (2007)
Article Google Scholar
Donahue, J., Hendricks, L.A., Rohrbach, M., et al.: Long-term recurrent convolutional networks for visual recognition and description. IEEE Trans. Pattern Anal. Mach. Intell. 39(4), 677–691 (2014)
Article Google Scholar
Wang, L., Xiong, Y., Wang, Z., et al.: Temporal segment networks: towards good practices for deep action recognition. In: European Conference on Computer Vision, pp. 20–36. Springer, Cham (2016)
Chapter Google Scholar
Zhu, W., Hu, J., Sun, G., et al.: A key volume mining deep framework for action recognition. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 1991–1999. IEEE Computer Society (2016)
Google Scholar
Wu, C.Y., Zaheer, M., Hu, H., et al.: Compressed video action recognition. In: Computer Vision and Pattern Recognition. IEEE (2018)
Google Scholar
Zang, J., Wang, L., Liu, Z., et al.: Attention-based temporal weighted convolutional neural network for action recognition. In: IFIP International Conference on Artificial Intelligence Applications and Innovations, pp. 97–108. Springer, Cham (2018)
Google Scholar
Du, Y., Wang, W., Wang, L.: Hierarchical recurrent neural network for skeleton based action recognition. In: Computer Vision and Pattern Recognition, pp. 1110–1118. IEEE (2015)
Google Scholar
Zhang, S., Xiao, J., Liu, X., et al.: Fusing geometric features for skeleton-based action recognition using multilayer LSTM networks. IEEE Trans. Multimed. 20, 2330–2343 (2018)
Article Google Scholar
Yan, S., Xiong, Y., Lin, D.: Spatial temporal graph convolutional networks for skeleton-based action recognition (2018)
Google Scholar
Tang, Y., Tian, Y., Lu, J., et al.: Deep progressive reinforcement learning for skeleton-based action recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 5323–5332 (2018)
Google Scholar
Schuldt, C., Laptev, I., Caputo, B.: Recognizing human actions: a local SVM approach. In: Proceedings of the 17th International Conference on Pattern Recognition, ICPR 2004, vol. 3, pp. 32–36. IEEE (2004)
Google Scholar
Kuehne, H., Jhuang, H., Garrote, E., et al.: HMDB: a large video database for human motion recognition. In: IEEE International Conference on Computer Vision (ICCV), pp. 2556–2563. IEEE (2011)
Google Scholar
Soomro, K., Zamir, A.R., Shah, M.: UCF101: a dataset of 101 human actions classes from videos in the wild. arXiv preprint arXiv:1212.0402 (2012)
Kay, W., Carreira, J., Simonyan, K., et al.: The kinetics human action video dataset. arXiv preprint arXiv:1705.06950 (2017)

Download references

Acknowledgment

This work is supported by National Natural Science Foundation of China (Project61471066) and the open project fund (No. 201600017) of the National Key Laboratory of Electromagnetic Environment, China.

Author information

Authors and Affiliations

National Engineering Laboratory for Mobile Network Security, Beijing University of Posts and Telecommunications, Beijing, China
Meixia Fu, Na Chen, Zhongjie Huang, Kaili Ni, Yuhao Liu & Songlin Sun
Key Laboratory of Trustworthy Distributed Computing and Service (BUPT), Ministry of Education, Beijing University of Posts and Telecommunications, Beijing, China
Meixia Fu, Na Chen, Zhongjie Huang, Kaili Ni, Yuhao Liu & Songlin Sun
School of Information and Communication Engineering, Beijing University of Posts and Telecommunications, Beijing, China
Meixia Fu, Na Chen, Zhongjie Huang, Kaili Ni, Yuhao Liu & Songlin Sun
China United Network Communications Group Co., Ltd., Beijing, China
Xiaomei Ma

Authors

Meixia Fu
View author publications
You can also search for this author in PubMed Google Scholar
Na Chen
View author publications
You can also search for this author in PubMed Google Scholar
Zhongjie Huang
View author publications
You can also search for this author in PubMed Google Scholar
Kaili Ni
View author publications
You can also search for this author in PubMed Google Scholar
Yuhao Liu
View author publications
You can also search for this author in PubMed Google Scholar
Songlin Sun
View author publications
You can also search for this author in PubMed Google Scholar
Xiaomei Ma
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Meixia Fu .

Editor information

Editors and Affiliations

Beijing University of Posts and Telecommunications, Beijing, China
Songlin Sun
Beijing University of Posts and Telecommunications, Beijing, China
Meixia Fu
China Unicom, Beijing, China
Lexi Xu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Fu, M. et al. (2019). Human Action Recognition: A Survey. In: Sun, S., Fu, M., Xu, L. (eds) Signal and Information Processing, Networking and Computers. ICSINC 2018. Lecture Notes in Electrical Engineering, vol 550. Springer, Singapore. https://doi.org/10.1007/978-981-13-7123-3_9

Download citation

DOI: https://doi.org/10.1007/978-981-13-7123-3_9
Published: 16 April 2019
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-7122-6
Online ISBN: 978-981-13-7123-3
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics