Abstract
This paper presents a novel approach to tracking articulated human motion with monocular video. In a conventional tracking system based on particle filters, it is very challenging to track a complex human pose with many degrees of freedom. A typical solution to this problem is to track the pose in a low dimensional latent space by manifold learning techniques, e.g., the Gaussian process dynamical model (GPDM model). In this paper, we extend the GPDM model into a graph structure (called GPDM graph) to better express the diverse dynamics of human motion, where multiple latent spaces are constructed and dynamically connected to each other appropriately by an unsupervised learning method. Basically, the proposed model has both intra-transitions (in each latent space) and inter-transitions (among latent spaces). Moreover, the probability of inter-transition is dynamic, depending on the current latent state. Using the proposed GPDM graph model, we can track human motion with monocular video, where the average tracking errors are improved from the state-of-the-art methods in our experiments.
Chapter PDF
References
Agarwal, A., Triggs, B.: Recovering 3d human pose from monocular images. IEEE Trans. on PAMI 28(1), 44–58 (2006)
Arikan, O.: Compression of motion capture databases. ACM Trans. on Graphics 25(3), 890–897 (2006)
Arikan, O., Forsyth, D.A.: Interactive motion generation from examples. ACM Trans. on Graphics 21(3), 483–490 (2002)
Arulampalam, M., Maskell, S., Gordon, N., Clapp, T.: A tutorial on particle filters for online nonlinear/non-gaussian bayesian tracking. IEEE Trans. on Signal Processing 50(2), 174–188 (2002)
Balan, A., Black, M.J.: An adaptive appearancemodel approach formodel-based articulated object tracking. In: IEEE CVPR, vol. 1, pp. 758–765 (2006)
Moeslund, T.B., Hilton, A., Kruger, V.: A survey of advances in vision based human motion capture and analysis. CVIU 104(2), 90–126 (2006)
Chen, J., Kim, M., Wang, Y., Ji, Q.: Switching gaussian process dynamic models for simultaneous composite motion tracking and recognition. In: IEEE CVPR, pp. 2655–2662 (2009)
Kovar, L., Gleicherl, M., Pighin, F.: Motion graphs. ACM Trans. on Graphics 21(3), 473–482 (2002)
Lawrence, N.: Gaussian process latent variable models for visualization. In: Proc. Adv. Neural Inf. Process. pp. 329–336 (2003)
Lee, J., Chai, J., Reitsma, P.S.A., Hodgins, J.K., Pollard, N.S.: Interactive control of avatars animated with human motion data. ACM Trans. on Graphics 21(3), 491–500 (2002)
Poppe, R.: Vision-based human motion analysis: an overview. CVIU 108(1/2), 4–18 (2007)
Quirion, S., Duchesne, C., Laurendeau, D., Marchand, M.: Comparing gplvm approaches for dimensionality reduction in character animation. Journal of WSCG 16(1-3), 41–48 (2008)
Sigal, L., Balan, A., Black, M.J.: Humaneva: Synchronized video and motion capture dataset and baseline algorithm for evaluation of articulated human motion. International Journal of Computer Vision 87(1), 4–27 (2010)
Taylor, G.W., Sigal, L., Fleet, D.J., Hinton, G.E.: Dynamical binary latent variable models for 3d human pose tracking. In: IEEE CVPR, pp. 631–638 (2010)
Urtasun, R., Darrell, T.: Sparse probabilistic regression for activity independent human pose inference. In: IEEE CVPR, pp. 1–8 (2008)
Urtasun, R., Fleet, D., Fua, P.: 3d people tracking with gaussian process dynamical models. In: IEEE CVPR, pp. 238–245 (2006)
Wang, J.M., Fleet, D.J., Hertzmann, A.: Gaussian process dynamical models for human motion. IEEE Trans. on PAMI 30(2), 283–298 (2008)
Wang, J., Bodenheimer, B.: Synthesis and evaluation of linear motion transitions. ACM Trans. on Graphics 27(1), 1:1–1:15 (2008)
Xu, J., Takagi, K., Yoneyama, A.: Beat induction from motion capture data using short-term principal component analysis. The Journal of The Institute of Image Information and Television Engineers 64(4), 577–583 (2010)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Xu, J., Takagi, K., Sakazawa, S. (2011). Human Motion Tracking with Monocular Video by Introducing a Graph Structure into Gaussian Process Dynamical Models. In: Ho, YS. (eds) Advances in Image and Video Technology. PSIVT 2011. Lecture Notes in Computer Science, vol 7087. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-25367-6_33
Download citation
DOI: https://doi.org/10.1007/978-3-642-25367-6_33
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-25366-9
Online ISBN: 978-3-642-25367-6
eBook Packages: Computer ScienceComputer Science (R0)