Skeleton-Based Human Action Recognition with Profile Hidden Markov Models

Ding, Wenwen; Liu, Kai; Cheng, Fei; Shi, Huan; Zhang, Baijian

doi:10.1007/978-3-662-48558-3_2

Wenwen Ding^14,15,
Kai Liu¹⁴,
Fei Cheng¹⁴,
Huan Shi¹⁴ &
…
Baijian Zhang¹⁴

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 546))

Included in the following conference series:

CCF Chinese Conference on Computer Vision

3169 Accesses
4 Citations

Abstract

Recognizing human actions from image sequences is an active area of research in computer vision. In this paper, a novel HMM-based approach is proposed for human action recognition using 3D positions of body joints. First, actions are segmented into meaningful action units called dynamic instants and intervals by using motion velocities, the direction of motion, and the curvatures of 3D trajectories. Then action unit with its spatio-temporal feature sets are clustered using unsupervised learning, like SOM, to generate a sequence of discrete symbols. To overcome an abrupt change or an abnormal in its gesticulation between different performances of the same action, Profile Hidden Markov Models (Profile HMMs) are applied with these symbol sequences using Viterbi and Baum-Welch algorithms for human activity recognition. The experimental evaluations show that the proposed approach achieves promising results compared to other state of the art algorithms.

Download to read the full chapter text

Chapter PDF

Real-Time Skeleton-Tracking-Based Human Action Recognition Using Kinect Data

Computer vision-based approach for skeleton-based action recognition, SAHC

Article 11 November 2023

Trajectory Based Integrated Features for Action Classification from Depth Data

Keywords

References

Niebles, J.C., Wang, H., Fei-Fei, L.: Unsupervised learning of human action categories using spatial-temporal words. International Journal of Computer Vision 79(3), 299–318 (2008)
Article Google Scholar
Laptev, I.: On space-time interest points. International Journal of Computer Vision. 64(2–3), 107–123 (2005)
Article Google Scholar
Rabiner, L.R., Juang, B.-H.: Fundamentals of speech recognition, vol. 14. PTR Prentice Hall Englewood Cliffs (1993)
Google Scholar
Yang, X., Tian, T.: Effective 3d action recognition using eigenjoints. Journal of Visual Communication and Image Representation 25(1), 2–11 (2014)
Article MathSciNet Google Scholar
Krogh, A., Brown, M., Mian, I.S., Sjolander, K., Haussler, D.: Hidden Markov models in computational biology: Applications to protein modeling. Journal of Molecular Biology 235(5), 1501–1531 (1994)
Article Google Scholar
Ding, W., Liu, K., Cheng, F., et al.: STFC: Spatio-temporal feature chain for skeleton-based human action recognition. Journal of Visual Communication and Image Representation 26, 329–337 (2015)
Article Google Scholar
Kohonen, T.: The self-organizing map. Proceedings of the IEEE 78(9), 1464–1480 (1990)
Article Google Scholar
Davies, D.L., Bouldin, D.W.: A cluster separation measure. IEEE Transactions on Pattern Analysis and Machine Intelligence PAMI 1(2), 224–227 (1979)
Article Google Scholar
Poppe, R.: A survey on vision-based human action recognition. Image and Vision Computing 28(6), 976–990 (2010)
Article Google Scholar
Weinland, D., Ronfard, R., Boyer, E.: A survey of vision-based methods for action representation, segmentation and recognition. Computer Vision and Image Understanding 115(2), 224–241 (2011)
Article Google Scholar
Zhang, Z.: Microsoft kinect sensor and its effect. IEEE MultiMedia 19(2), 4–10 (2012)
Article Google Scholar
Li, W., Zhang, Z., Liu, Z.: Action recognition based on a bag of 3d points. In: 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), pp. 9–14.IEEE (2010)
Google Scholar
Xia, L., Chen, C.-C., Aggarwal, J.: View invariant human action recognition using histograms of 3d joints. In: 2012 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), pp. 20–27. IEEE (2012)
Google Scholar
Lv, F., Nevatia, R.: Single view human action recognition using key pose matching and viterbi path searching. In: 2007 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2007, pp. 1–8. IEEE (2007)
Google Scholar
Zhou, F., Torre, F.: Canonical time warping for alignment of human behavior. In: Advances in Neural Information Processing Systems, pp. 2286–2294 (2009)
Google Scholar
Ferguson, J.D.: Variable duration models for speech. In: Proceedings of the Symposium on the Application of HMMs to Text and Speech, pp. 143–179 (1980)
Google Scholar
Viterbi, A.J.: Error bounds for convolutional codes and an asymptotically optimum decoding algorithm. IEEE Transactions on Information Theory 13(2), 260–269 (1967)
Article MATH Google Scholar
Baum, L.E., Petrie, T., Soules, G., Weiss, N.: A maximization technique occurring in the statistical analysis of probabilistic functions of Markov chains. The Annals of Mathematical Statistics 164–171 (1970)
Google Scholar
Ellis, C., Masood, S.Z., Tappen, M.F., Laviola Jr, J.J., Sukthankar, R.: Exploring the trade-off between accuracy and observational latency in action recognition. International Journal of Computer Vision 101(3), 420–436 (2013)
Article Google Scholar

Download references

Author information

Authors and Affiliations

School of Computer Science and Technology, Xidian University, Xi’an, China
Wenwen Ding, Kai Liu, Fei Cheng, Huan Shi & Baijian Zhang
School of Mathematical Sciences, Huaibei Normal University, Anhui, China
Wenwen Ding

Authors

Wenwen Ding
View author publications
You can also search for this author in PubMed Google Scholar
Kai Liu
View author publications
You can also search for this author in PubMed Google Scholar
Fei Cheng
View author publications
You can also search for this author in PubMed Google Scholar
Huan Shi
View author publications
You can also search for this author in PubMed Google Scholar
Baijian Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Kai Liu .

Editor information

Editors and Affiliations

Peking University, Beijing, China
Honbin Zha
Chinese Academy of Sciences, Institute of Computing Technology, Beijing, China
Xilin Chen
Chinese Academy of Sciences, Beijing, China
Liang Wang
Xidian University, Shaanxi, China
Qiguang Miao

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ding, W., Liu, K., Cheng, F., Shi, H., Zhang, B. (2015). Skeleton-Based Human Action Recognition with Profile Hidden Markov Models. In: Zha, H., Chen, X., Wang, L., Miao, Q. (eds) Computer Vision. CCCV 2015. Communications in Computer and Information Science, vol 546. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-48558-3_2

Download citation

DOI: https://doi.org/10.1007/978-3-662-48558-3_2
Published: 06 November 2015
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-662-48557-6
Online ISBN: 978-3-662-48558-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

the China Computer Federation (CCF) (opens in a new tab)

Skeleton-Based Human Action Recognition with Profile Hidden Markov Models

Abstract

Chapter PDF

Similar content being viewed by others

Real-Time Skeleton-Tracking-Based Human Action Recognition Using Kinect Data

Computer vision-based approach for skeleton-based action recognition, SAHC

Trajectory Based Integrated Features for Action Classification from Depth Data

Keywords

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Societies and partnerships

Navigation

Skeleton-Based Human Action Recognition with Profile Hidden Markov Models

Abstract

Chapter PDF

Similar content being viewed by others

Real-Time Skeleton-Tracking-Based Human Action Recognition Using Kinect Data

Computer vision-based approach for skeleton-based action recognition, SAHC

Trajectory Based Integrated Features for Action Classification from Depth Data

Keywords

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Societies and partnerships

Search

Navigation