Abstract
Our goal is to automatically segment and recognize basic human actions, such as stand, walk and wave hands, from a sequence of joint positions or pose angles. Such recognition is difficult due to high dimensionality of the data and large spatial and temporal variations in the same action. We decompose the high dimensional 3-D joint space into a set of feature spaces where each feature corresponds to the motion of a single joint or combination of related multiple joints. For each feature, the dynamics of each action class is learned with one HMM. Given a sequence, the observation probability is computed in each HMM and a weak classifier for that feature is formed based on those probabilities. The weak classifiers with strong discriminative power are then combined by the Multi-Class AdaBoost (AdaBoost.M2) algorithm. A dynamic programming algorithm is applied to segment and recognize actions simultaneously. Results of recognizing 22 actions on a large number of motion capture sequences as well as several annotated and automatically tracked sequences show the effectiveness of the proposed algorithms.
This research was supported, in part, by the Advanced Research and Development Activity of the U.S. Government under contract No. MDA904-03-C1786.
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Campbell, L., Bobick, A.: Recognition of human body motion using phase space constraints. In: Proc. of ICCV, pp. 624–630 (1995)
Davis, J., Bobick, A.: The Representation and Recognition of Action Using Temporal Templates. In: Proc. of CVPR, pp. 928–934 (1997)
Efros, A.A., Berg, A.C., Mori, G., Malik, J.: Recognizing Action at a Distance. In: Proc. of ICCV, pp. 726–733 (2003)
Freund, Y., Schapire, R.E.: A decision theoretic generalization of on-line learning and application to boosting. Journal of Computer and System Science 55(1), 119–139 (1995)
Hongeng, S., Nevatia, R.: Large-Scale Event Detection Using Semi-Hidden Markov Models. In: Proc. of ICCV, pp. 1455–1462 (2003)
Lee, M.W., Nevatia, R.: Dynamic Human Pose Estimation using Markov chain Monte Carlo Approach. In: Proc. of the IEEE Workshop on Motion and Video Computing, WACV/MOTION 2005 (2005)
Lv, F., Nevatia, R.: 3D Human Action Recognition Using Spatio-Temporal Motion Templates. In: Proc. of the IEEE Workshop on Human-Computer Interaction, HCI 2005 (2005)
Rabiner, L.R.: A tutorial on Hidden Markov Models and selected applications in speech recognition. Proc. of the IEEE 77(2), 257–286 (1989)
Rao, C., Yilmaz, A., Shah, M.: View-Invariant Representation and Recognition of Actions. Int’l Journal of Computer Vision 50(2), 203–226 (2002)
Shechtman, E., Irani, M.: Space-Time Behavior Based Correlation. In: Proc. of CVPR, vol. I, pp. 405–412 (2005)
Viola, P., Jones, M.: Rapid Object Detection Using a Boosted Cascade of Simple Features. In: Proc. of CVPR, pp. 511–518 (2001)
Yilmaz, A., Shah, M.: Actions Sketch: A Novel Action Representation. In: Proc. of CVPR, vol. I, pp. 984–989 (2005)
Yin, P., Essa, I., Rehg, J.M.: Asymmetrically Boosted HMM for Speech Reading. In: Proc. of CVPR, vol. II, pp. 755–761 (2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Lv, F., Nevatia, R. (2006). Recognition and Segmentation of 3-D Human Action Using HMM and Multi-class AdaBoost. In: Leonardis, A., Bischof, H., Pinz, A. (eds) Computer Vision – ECCV 2006. ECCV 2006. Lecture Notes in Computer Science, vol 3954. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11744085_28
Download citation
DOI: https://doi.org/10.1007/11744085_28
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-33838-3
Online ISBN: 978-3-540-33839-0
eBook Packages: Computer ScienceComputer Science (R0)