Generative 2D and 3D Human Pose Estimation with Vote Distributions

Brauer, Jürgen; Hübner, Wolfgang; Arens, Michael

doi:10.1007/978-3-642-33179-4_45

Jürgen Brauer²⁸,
Wolfgang Hübner²⁸ &
Michael Arens²⁸

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 7431))

Included in the following conference series:

International Symposium on Visual Computing

3885 Accesses
3 Citations

Abstract

We address the problem of 2D and 3D human pose estimation using monocular camera information only. Generative approaches usually consist of two computationally demanding steps. First, different configurations of a complex 3D body model are projected into the image plane. Second, the projected synthetic person images and images of real persons are compared on a feature basis, like silhouettes or edges. In order to lower the computational costs of generative models, we propose to use vote distributions for anatomical landmarks generated by an Implicit Shape Model for each landmark. These vote distributions represent the image evidence in a more compact form and make the use of a simple 3D stick-figure body model possible since projected 3D marker points of the stick-figure can be compared with vote locations directly with negligible computational costs, which allows to consider near to half a million of different 3D poses per second on standard hardware and further to consider a huge set of 3D pose and configuration hypotheses in each frame. The approach is evaluated on the new Utrecht Multi-Person Motion (UMPM) benchmark with the result of an average joint angle reconstruction error of 8.0°.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Sminchisescu, C.: 3D human Motion Reconstruction in Monocular Video. Techniques and Challenges. In: Human Motion Capture: Modeling, Analysis, Animation, vol. 36. Springer (2007) ISBN 978-1-4020-6692-4
Google Scholar
Poppe, R.: Vision-based human motion analysis: An overview. CVIU 108, 4–18 (2007)
Google Scholar
Ji, X., Liu, H.: Advances in view-invariant human motion analysis: A review. IEEE Transactions on Systems, Man, and Cybernetics, Part C 40, 13–24 (2010)
Google Scholar
Bregler, C., Malik, J.: Tracking people with twists and exponential maps, p. 8. IEEE Computer Society, Los Alamitos (1998)
Google Scholar
Roth, S., Sigal, L., Black, M.J.: Gibbs likelihoods for bayesian tracking. In: CVPR, pp. 886–893 (2004)
Google Scholar
Sminchisescu, C., Triggs, B.: Kinematic jump processes for monocular 3d human tracking. In: CVPR, vol. 1, p. 69 (2003)
Google Scholar
Sigal, L., Balan, A.O., Black, M.J.: Combined discriminative and generative articulated pose and non-rigid shape estimation. In: NIPS (2007)
Google Scholar
Drummond, T., Cipolla, R.: Real-time tracking of highly articulated structures in the presence of noisy measurements. In: ICCV, pp. 315–320 (2001)
Google Scholar
Charles, J., Everingham, M.: Learning shape models for monocular human pose estimation from the microsoft xbox kinect. In: ICCV Workshops, pp. 1202–1208. IEEE (2011)
Google Scholar
Leibe, B., Leonardis, A., Schiele, B.: Robust object detection with interleaved categorization and segmentation. IJCV 77, 259–289 (2008)
Article Google Scholar
Müller, J., Arens, M.: Human pose estimation with implicit shape models. In: ACM Artemis, ARTEMIS 2010, pp. 9–14. ACM, New York (2010)
Google Scholar
Sigal, L., Black, M.J.: Predicting 3D People from 2D Pictures. In: Perales, F.J., Fisher, R.B. (eds.) AMDO 2006. LNCS, vol. 4069, pp. 185–195. Springer, Heidelberg (2006)
Chapter Google Scholar
Andriluka, M., Roth, S., Schiele, B.: Monocular 3d pose estimation and tracking by detection. In: Proc. of CVPR 2010, USA (2010)
Google Scholar
Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection, vol. 1, pp. 886–893 (2005)
Google Scholar
Dollár, P., Belongie, S., Perona, P.: The fastest pedestrian detector in the west. In: BMVC, Aberystwyth, UK (2010)
Google Scholar
Dollár, P., Tu, Z., Perona, P., Belongie, S.: Integral channel features. In: BMVC (2009)
Google Scholar
Dollár, P., Wojek, C., Schiele, B., Perona, P.: Pedestrian detection: An evaluation of the state of the art. PAMI 99 (2011)
Google Scholar
Andriluka, M., Roth, S., Schiele, B.: Pictorial structures revisited: People detection and articulated pose estimation. In: CVPR, pp. 1014–1021 (2009)
Google Scholar
Taylor, C.J.: Reconstruction of articulated objects from point correspondences in a single uncalibrated image. CVIU 80, 349–363 (2000)
MATH Google Scholar
Aa, N.v.d., Luo, X., Giezeman, G., Tan, R., Veltkamp, R.: Utrecht multi-person motion (umpm) benchmark: a multi-person dataset with synchronized video and motion capture data for evaluation of articulated human motion and interaction. In: HICV Workshop, in Conj. with ICCV (2011)
Google Scholar
Agarwal, A., Triggs, B.: Recovering 3d human pose from monocular images. PAMI 28, 44–58 (2006)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Fraunhofer Institute of Optronics, System Technologies and Image Exploitation, Gutleuthausstr. 1, 76275, Ettlingen, Germany
Jürgen Brauer, Wolfgang Hübner & Michael Arens

Authors

Jürgen Brauer
View author publications
You can also search for this author in PubMed Google Scholar
Wolfgang Hübner
View author publications
You can also search for this author in PubMed Google Scholar
Michael Arens
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science, University of Nevada, 89557, Reno, NV, USA
George Bebis
NASA Ames Research Center, 94035, Moffett Field, CA, USA
Richard Boyle
Lawrence Berkeley National Laboratory, 94720, Berkeley, CA, USA
Bahram Parvin
Desert Research Institute, 89512, Reno, NV, USA
Darko Koracin
Department of Computer Science, University of California at Irvine, 92697-3435, Irvine, CA
Charless Fowlkes
Eastman Kodak Company, 14650-2102, Rochester, NY, USA
Sen Wang
Department of Computer Science and Engineering, University of Colorado at Denver, 80217, Denver, CO, USA
Min-Hyung Choi
VRVis Zentrum für Virtual Reality and Visualisierung, 1220, Vienna, Austria
Stephan Mantler
California Institute for Telecommunications and Information Technology, University of California, 92093, San Diego, La Jolla, CA, USA
Jürgen Schulze
KAUST Visualizatioin Core Lab., 23955-6900, Thurwal, Saudi Arabia
Daniel Acevedo
Stony Brook University, 11794-4400, NY, USA
Klaus Mueller
Argonne National Laboratory, 60439, IL, USA
Michael Papka

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Brauer, J., Hübner, W., Arens, M. (2012). Generative 2D and 3D Human Pose Estimation with Vote Distributions. In: Bebis, G., et al. Advances in Visual Computing. ISVC 2012. Lecture Notes in Computer Science, vol 7431. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-33179-4_45

Download citation

DOI: https://doi.org/10.1007/978-3-642-33179-4_45
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-33178-7
Online ISBN: 978-3-642-33179-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics