Abstract
Recognizing human actions from image sequences is an active area of research in computer vision. In this paper, a novel HMM-based approach is proposed for human action recognition using 3D positions of body joints. First, actions are segmented into meaningful action units called dynamic instants and intervals by using motion velocities, the direction of motion, and the curvatures of 3D trajectories. Then action unit with its spatio-temporal feature sets are clustered using unsupervised learning, like SOM, to generate a sequence of discrete symbols. To overcome an abrupt change or an abnormal in its gesticulation between different performances of the same action, Profile Hidden Markov Models (Profile HMMs) are applied with these symbol sequences using Viterbi and Baum-Welch algorithms for human activity recognition. The experimental evaluations show that the proposed approach achieves promising results compared to other state of the art algorithms.
Chapter PDF
Similar content being viewed by others
Keywords
References
Niebles, J.C., Wang, H., Fei-Fei, L.: Unsupervised learning of human action categories using spatial-temporal words. International Journal of Computer Vision 79(3), 299–318 (2008)
Laptev, I.: On space-time interest points. International Journal of Computer Vision. 64(2–3), 107–123 (2005)
Rabiner, L.R., Juang, B.-H.: Fundamentals of speech recognition, vol. 14. PTR Prentice Hall Englewood Cliffs (1993)
Yang, X., Tian, T.: Effective 3d action recognition using eigenjoints. Journal of Visual Communication and Image Representation 25(1), 2–11 (2014)
Krogh, A., Brown, M., Mian, I.S., Sjolander, K., Haussler, D.: Hidden Markov models in computational biology: Applications to protein modeling. Journal of Molecular Biology 235(5), 1501–1531 (1994)
Ding, W., Liu, K., Cheng, F., et al.: STFC: Spatio-temporal feature chain for skeleton-based human action recognition. Journal of Visual Communication and Image Representation 26, 329–337 (2015)
Kohonen, T.: The self-organizing map. Proceedings of the IEEE 78(9), 1464–1480 (1990)
Davies, D.L., Bouldin, D.W.: A cluster separation measure. IEEE Transactions on Pattern Analysis and Machine Intelligence PAMI 1(2), 224–227 (1979)
Poppe, R.: A survey on vision-based human action recognition. Image and Vision Computing 28(6), 976–990 (2010)
Weinland, D., Ronfard, R., Boyer, E.: A survey of vision-based methods for action representation, segmentation and recognition. Computer Vision and Image Understanding 115(2), 224–241 (2011)
Zhang, Z.: Microsoft kinect sensor and its effect. IEEE MultiMedia 19(2), 4–10 (2012)
Li, W., Zhang, Z., Liu, Z.: Action recognition based on a bag of 3d points. In: 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), pp. 9–14.IEEE (2010)
Xia, L., Chen, C.-C., Aggarwal, J.: View invariant human action recognition using histograms of 3d joints. In: 2012 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), pp. 20–27. IEEE (2012)
Lv, F., Nevatia, R.: Single view human action recognition using key pose matching and viterbi path searching. In: 2007 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2007, pp. 1–8. IEEE (2007)
Zhou, F., Torre, F.: Canonical time warping for alignment of human behavior. In: Advances in Neural Information Processing Systems, pp. 2286–2294 (2009)
Ferguson, J.D.: Variable duration models for speech. In: Proceedings of the Symposium on the Application of HMMs to Text and Speech, pp. 143–179 (1980)
Viterbi, A.J.: Error bounds for convolutional codes and an asymptotically optimum decoding algorithm. IEEE Transactions on Information Theory 13(2), 260–269 (1967)
Baum, L.E., Petrie, T., Soules, G., Weiss, N.: A maximization technique occurring in the statistical analysis of probabilistic functions of Markov chains. The Annals of Mathematical Statistics 164–171 (1970)
Ellis, C., Masood, S.Z., Tappen, M.F., Laviola Jr, J.J., Sukthankar, R.: Exploring the trade-off between accuracy and observational latency in action recognition. International Journal of Computer Vision 101(3), 420–436 (2013)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Ding, W., Liu, K., Cheng, F., Shi, H., Zhang, B. (2015). Skeleton-Based Human Action Recognition with Profile Hidden Markov Models. In: Zha, H., Chen, X., Wang, L., Miao, Q. (eds) Computer Vision. CCCV 2015. Communications in Computer and Information Science, vol 546. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-48558-3_2
Download citation
DOI: https://doi.org/10.1007/978-3-662-48558-3_2
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-662-48557-6
Online ISBN: 978-3-662-48558-3
eBook Packages: Computer ScienceComputer Science (R0)