A Deep Learning Approach for Human Action Recognition Using Skeletal Information

Mathe, Eirini; Maniatis, Apostolos; Spyrou, Evaggelos; Mylonas, Phivos

doi:10.1007/978-3-030-32622-7_9

Eirini Mathe^6,7,
Apostolos Maniatis⁸,
Evaggelos Spyrou^6,8 &
…
Phivos Mylonas⁷

Part of the book series: Advances in Experimental Medicine and Biology ((AEMB,volume 1194))

1489 Accesses
6 Citations

Abstract

In this paper we present an approach toward human action detection for activities of daily living (ADLs) that uses a convolutional neural network (CNN). The network is trained on discrete Fourier transform (DFT) images that result from raw sensor readings, i.e., each human action is ultimately described by an image. More specifically, we work using 3D skeletal positions of human joints, which originate from processing of raw RGB sequences enhanced by depth information. The motion of each joint may be described by a combination of three 1D signals, representing its coefficients into a 3D Euclidean space. All such signals from a set of human joints are concatenated to form an image, which is then transformed by DFT and is used for training and evaluation of a CNN. We evaluate our approach using a publicly available challenging dataset of human actions that may involve one or more body parts simultaneously and for two sets of actions which resemble common ADLs.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Hardcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Abadi M et al (2016) TensorFlow: a system for large-scale machine learning. In: Proceedings of the USENIX symposium on operating systems design and implementation (OSDI)
Google Scholar
Berretti S, Daoudi M, Turaga P, Basu A (2018) Representation, analysis, and recognition of 3D humans: a survey. ACM Trans Multim Comput Commun Appl (TOMM) 14(1s):16
Google Scholar
Chollet F (2015) Keras. https://github.com/fchollet/keras
Du Y, Fu Y, Wang L (2015) Skeleton based action recognition with convolutional neural network. In: Proceedings of 3rd IAPR Asian conference on pattern recognition (ACPR). IEEE
Google Scholar
Graves A, Mohamed AR, Hinton G (2013) Speech recognition with deep recurrent neural networks. In: Proceedings of IEEE international conference of acoustics, speech and signal processing (ICASSP)
Google Scholar
Hou Y, Li Z, Wang P, Li W (2018) Skeleton optical spectra-based action recognition using convolutional neural networks. IEEE Trans Circuits Syst Video Technol 28(3):807–811
Article Google Scholar
Jiang W, Yin Z (2015) Human activity recognition using wearable sensors by deep convolutional neural networks. In: Proceedings of ACM international conference on multimedia (MM)
Google Scholar
Ke Q, An S, Bennamoun M, Sohel F, Boussaid F (2017) Skeletonnet: mining deep part features for 3-d action recognition. IEEE Signal Process Lett 24(6):731–735
Article Google Scholar
Krizhevsky A, Sutskever I, Hinton GE (2012) Imagenet classification with deep convolutional neural networks. In: Advances in neural information processing systems, pp 1097–1105
Google Scholar
Lawton MP, Brody EM (1969) Assessment of older people: self-maintaining and instrumental activities of daily living. Gerontologist 9(3 Part 1):179–186
Article CAS Google Scholar
LeCun Y, Bottou L, Bengio Y, Haner P (1998) Gradient-based learning applied to document recognition. Proc IEEE 86(11):2278–2324
Article Google Scholar
Li C, Hou Y, Wang P, Li W (2017) Joint distance maps based action recognition with convolutional neural networks. IEEE Signal Process Lett 24(5):624–628
Article Google Scholar
Liu C, Hu Y, Li Y, Song S, Liu J (2017a) PKU-MMD: a large scale benchmark for continuous multi-modal human action understanding. In: Proceedings of ACM multimedia workshop (MM)
Google Scholar
Liu M, Liu H, Chen C (2017b) Enhanced skeleton visualization for view invariant human action recognition. Pattern Recogn 68:346–362
Article Google Scholar
Mathe E, Mitsou A, Spyrou E, Mylonas P (2018) Arm gesture recognition using a convolutional neural network. In: Proceedings of international workshop on semantic and social media adaptation and personalization (SMAP)
Google Scholar
Srivastava N, Hinton G, Krizhevsky A, Sutskever I, Salakhutdinov R (2014) Dropout: a simple way to prevent neural networks from overfitting. J Mach Learn Res 15(1):1929–1958
Google Scholar
Wang P, Li W, Ogunbona P, Wan J, Escalera S (2018a) RGB-D-based human motion recognition with deep learning: a survey. Comput Vis Image Underst 171:118–139
Article CAS Google Scholar
Wang P, Li W, Li C, Hou Y (2018b) Action recognition based on joint trajectory maps with convolutional neural networks. Knowl-Based Syst 158:43–53
Article Google Scholar
Zhang Z (2012) Microsoft Kinect sensor and its effect. IEEE Multim 19(2):4–10
Article Google Scholar

Download references

Acknowledgments

We acknowledge support for this work by the project SYNTELESIS “Innovative Technologies and Applications based on the Internet of Things (IoT) and the Cloud Computing” (MIS 5002521) which is implemented under the “Action for the Strategic Development on the Research and Technological Sector,” funded by the “Operational Programme Competitiveness, Entrepreneurship and Innovation” (NSRF 2014–2020) and co-financed by Greece and the European Union (European Regional Development Fund).

Author information

Authors and Affiliations

Institute of Informatics and Telecommunications, National Center for Scientific Research- “Demokritos”, Athens, Greece
Eirini Mathe & Evaggelos Spyrou
Department of Informatics, Ionian University, Corfu, Greece
Eirini Mathe & Phivos Mylonas
Department of Computer Engineering T.E, Technological Education Institute of Sterea Ellada, Lamia, Greece
Apostolos Maniatis & Evaggelos Spyrou

Authors

Eirini Mathe
View author publications
You can also search for this author in PubMed Google Scholar
Apostolos Maniatis
View author publications
You can also search for this author in PubMed Google Scholar
Evaggelos Spyrou
View author publications
You can also search for this author in PubMed Google Scholar
Phivos Mylonas
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Evaggelos Spyrou or Phivos Mylonas .

Editor information

Editors and Affiliations

Department of Informatics, Ionian University, Corfu, Greece
Panayiotis Vlamos

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Mathe, E., Maniatis, A., Spyrou, E., Mylonas, P. (2020). A Deep Learning Approach for Human Action Recognition Using Skeletal Information. In: Vlamos, P. (eds) GeNeDis 2018. Advances in Experimental Medicine and Biology, vol 1194. Springer, Cham. https://doi.org/10.1007/978-3-030-32622-7_9

Download citation

DOI: https://doi.org/10.1007/978-3-030-32622-7_9
Published: 29 May 2020
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-32621-0
Online ISBN: 978-3-030-32622-7
eBook Packages: Biomedical and Life SciencesBiomedical and Life Sciences (R0)

Publish with us

Policies and ethics