Abstract
In this paper, we address the problem of 3D human body pose estimation from depth images acquired by a stereo camera. Compared to the Kinect sensor, stereo cameras work outdoors having a much higher operational range, but produce noisier data. In order to deal with such data, we propose a framework for 3D human pose estimation that relies on random forests. The first contribution is a novel grid-based shape descriptor robust to noisy stereo data that can be used by any classifier. The second contribution is a two step classification procedure, first classifying the body orientation, then proceeding with determining the full 3D pose within this orientation cluster. To validate our method, we introduce a dataset recorded with a stereo camera synchronized with an optical motion capture system that provides ground truth human body poses.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Shotton, J., Fitzgibbon, A., Cook, M., Finocchio, T.S.M., Moore, R., Kipman, A., Blake, A.: Real-time human pose recognition in parts from single depth images. In: CVPR (2011)
Girshick, R., Shotton, J., Kohli, P., Criminisi, A., Fitzgibbon, A.: Efficient regression of general-activity human poses from depth images. In: ICCV (2011)
Taylor, J., Shotton, J., Sharp, T., Fitzgibbon, A.: The vitruvian manifold: Inferring dense correspondences for one-shot human pose estimation. In: 2012 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 103–110. IEEE (2012)
Pons-Moll, G., Taylor, J., Shotton, J., Hertzmann, A., Fitzgibbon, A.: Metric regression forests for human pose estimation. In: BMVC 2013 (2013)
Flohr, F., Gavrila, D.M.: Pedcut: An iterative framework for pedestrian segmentation combining shape models and multiple data cues. In: BMVC 2013 (2013)
Sun, M., Kohli, P., Shotton, J.: Conditional regression forests for human pose estimation. In: IEEE Computer Vision and Pattern Recognition (CVPR), pp. 3394–3401 (2012)
Plänkers, R., Fua, P.: Articulated soft objects for multi-view shape and motion capture. IEEE Trans. Pattern Anal. Mach. Intell. 25(10) (2003)
Urtasun, R., Fua, P.: 3d human body tracking using deterministic temporal motion models. In: Pajdla, T., Matas, J(G.) (eds.) ECCV 2004. LNCS, vol. 3023, pp. 92–106. Springer, Heidelberg (2004)
Bernier, O., Cheung-Mon-Chan, P., Bouguet, A.: Fast nonparametric belief propagation for real-time stereo articulated body tracking. Computer Vision and Image Understanding 113(1), 29–47 (2009)
Keskin, C., Kıraç, F., Kara, Y.E., Akarun, L.: Hand pose estimation and hand shape classification using multi-layered randomized decision forests. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012, Part VI. LNCS, vol. 7577, pp. 852–863. Springer, Heidelberg (2012)
Enzweiler, M., Gavrila, D.M.: Integrated pedestrian classification and orientation estimation. In: IEEE Computer Vision and Pattern Recognition, CVPR (2010)
Comaniciu, D., Meer, P.: Mean shift: A robust approach toward feature space analysis. IEEE Trans. Pattern Anal. Mach. Intell. 24(5) (2002)
Kontschieder, P., Kohli, P., Shotton, J., Criminisi, A.: Geof: Geodesic forests for learning coupled predictors. In: CVPR 2013 (2013)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Lallemand, J., Szczot, M., Ilic, S. (2014). Human Pose Estimation in Stereo Images. In: Perales, F.J., Santos-Victor, J. (eds) Articulated Motion and Deformable Objects. AMDO 2014. Lecture Notes in Computer Science, vol 8563. Springer, Cham. https://doi.org/10.1007/978-3-319-08849-5_2
Download citation
DOI: https://doi.org/10.1007/978-3-319-08849-5_2
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-08848-8
Online ISBN: 978-3-319-08849-5
eBook Packages: Computer ScienceComputer Science (R0)