Skip to main content

Human Pose Estimation in Stereo Images

  • Conference paper
Book cover Articulated Motion and Deformable Objects (AMDO 2014)

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 8563))

Included in the following conference series:

Abstract

In this paper, we address the problem of 3D human body pose estimation from depth images acquired by a stereo camera. Compared to the Kinect sensor, stereo cameras work outdoors having a much higher operational range, but produce noisier data. In order to deal with such data, we propose a framework for 3D human pose estimation that relies on random forests. The first contribution is a novel grid-based shape descriptor robust to noisy stereo data that can be used by any classifier. The second contribution is a two step classification procedure, first classifying the body orientation, then proceeding with determining the full 3D pose within this orientation cluster. To validate our method, we introduce a dataset recorded with a stereo camera synchronized with an optical motion capture system that provides ground truth human body poses.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Shotton, J., Fitzgibbon, A., Cook, M., Finocchio, T.S.M., Moore, R., Kipman, A., Blake, A.: Real-time human pose recognition in parts from single depth images. In: CVPR (2011)

    Google Scholar 

  2. Girshick, R., Shotton, J., Kohli, P., Criminisi, A., Fitzgibbon, A.: Efficient regression of general-activity human poses from depth images. In: ICCV (2011)

    Google Scholar 

  3. Taylor, J., Shotton, J., Sharp, T., Fitzgibbon, A.: The vitruvian manifold: Inferring dense correspondences for one-shot human pose estimation. In: 2012 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 103–110. IEEE (2012)

    Google Scholar 

  4. Pons-Moll, G., Taylor, J., Shotton, J., Hertzmann, A., Fitzgibbon, A.: Metric regression forests for human pose estimation. In: BMVC 2013 (2013)

    Google Scholar 

  5. Flohr, F., Gavrila, D.M.: Pedcut: An iterative framework for pedestrian segmentation combining shape models and multiple data cues. In: BMVC 2013 (2013)

    Google Scholar 

  6. Sun, M., Kohli, P., Shotton, J.: Conditional regression forests for human pose estimation. In: IEEE Computer Vision and Pattern Recognition (CVPR), pp. 3394–3401 (2012)

    Google Scholar 

  7. Plänkers, R., Fua, P.: Articulated soft objects for multi-view shape and motion capture. IEEE Trans. Pattern Anal. Mach. Intell. 25(10) (2003)

    Google Scholar 

  8. Urtasun, R., Fua, P.: 3d human body tracking using deterministic temporal motion models. In: Pajdla, T., Matas, J(G.) (eds.) ECCV 2004. LNCS, vol. 3023, pp. 92–106. Springer, Heidelberg (2004)

    Chapter  Google Scholar 

  9. Bernier, O., Cheung-Mon-Chan, P., Bouguet, A.: Fast nonparametric belief propagation for real-time stereo articulated body tracking. Computer Vision and Image Understanding 113(1), 29–47 (2009)

    Article  Google Scholar 

  10. Keskin, C., Kıraç, F., Kara, Y.E., Akarun, L.: Hand pose estimation and hand shape classification using multi-layered randomized decision forests. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012, Part VI. LNCS, vol. 7577, pp. 852–863. Springer, Heidelberg (2012)

    Chapter  Google Scholar 

  11. Enzweiler, M., Gavrila, D.M.: Integrated pedestrian classification and orientation estimation. In: IEEE Computer Vision and Pattern Recognition, CVPR (2010)

    Google Scholar 

  12. Comaniciu, D., Meer, P.: Mean shift: A robust approach toward feature space analysis. IEEE Trans. Pattern Anal. Mach. Intell. 24(5) (2002)

    Google Scholar 

  13. Kontschieder, P., Kohli, P., Shotton, J., Criminisi, A.: Geof: Geodesic forests for learning coupled predictors. In: CVPR 2013 (2013)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Lallemand, J., Szczot, M., Ilic, S. (2014). Human Pose Estimation in Stereo Images. In: Perales, F.J., Santos-Victor, J. (eds) Articulated Motion and Deformable Objects. AMDO 2014. Lecture Notes in Computer Science, vol 8563. Springer, Cham. https://doi.org/10.1007/978-3-319-08849-5_2

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-08849-5_2

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-08848-8

  • Online ISBN: 978-3-319-08849-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics