Pose Machines: Articulated Pose Estimation via Inference Machines

Ramakrishna, Varun; Munoz, Daniel; Hebert, Martial; Andrew Bagnell, James; Sheikh, Yaser

doi:10.1007/978-3-319-10605-2_3

Varun Ramakrishna¹⁹,
Daniel Munoz¹⁹,
Martial Hebert¹⁹,
James Andrew Bagnell¹⁹ &
…
Yaser Sheikh¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 8690))

Included in the following conference series:

European Conference on Computer Vision

18k Accesses
89 Citations

Abstract

State-of-the-art approaches for articulated human pose estimation are rooted in parts-based graphical models. These models are often restricted to tree-structured representations and simple parametric potentials in order to enable tractable inference. However, these simple dependencies fail to capture all the interactions between body parts. While models with more complex interactions can be defined, learning the parameters of these models remains challenging with intractable or approximate inference. In this paper, instead of performing inference on a learned graphical model, we build upon the inference machine framework and present a method for articulated human pose estimation. Our approach incorporates rich spatial interactions among multiple parts and information across parts of different scales. Additionally, the modular framework of our approach enables both ease of implementation without specialized optimization solvers, and efficient inference. We analyze our approach on two challenging datasets with large pose variation and outperform the state-of-the-art on these benchmarks.

Download to read the full chapter text

Chapter PDF

DeeperCut: A Deeper, Stronger, and Faster Multi-person Pose Estimation Model

Multi-view Pose Estimation with Flexible Mixtures-of-Parts

Integral Human Pose Regression

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Felzenszwalb, P.F., Huttenlocher, D.P.: Pictorial structures for object recognition. In: IJCV (2005)
Google Scholar
Ramanan, D., Forsyth, D.A., Zisserman, A.: Strike a Pose: Tracking people by finding stylized poses. In: CVPR (2005)
Google Scholar
Andriluka, M., Roth, S., Schiele, B.: Monocular 3D Pose Estimation and Tracking by Detection. In: CVPR (2010)
Google Scholar
Andriluka, M., Roth, S., Schiele, B.: Pictorial Structures Revisited: People Detection and Articulated Pose Estimation. In: CVPR (2009)
Google Scholar
Yang, Y., Ramanan, D.: Articulated pose estimation with flexible mixtures-of-parts. In: CVPR (2011)
Google Scholar
Johnson, S., Everingham, M.: Clustered pose and nonlinear appearance models for human pose estimation. In: BMVC (2010)
Google Scholar
Kulesza, A., Pereira, F.: Structured learning with approximate inference. In: NIPS (2007)
Google Scholar
Munoz, D., Bagnell, J.A., Hebert, M.: Stacked hierarchical labeling. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part VI. LNCS, vol. 6316, pp. 57–70. Springer, Heidelberg (2010)
Chapter Google Scholar
Ross, S., Munoz, D., Hebert, M., Bagnell, J.A.: Learning message-passing inference machines for structured prediction. In: CVPR (2011)
Google Scholar
Pishchulin, L., Andriluka, M., Gehler, P., Schiele, B.: Poselet conditioned pictorial structures. In: CVPR (2013)
Google Scholar
Sapp, B., Taskar, B.: MODEC: Multimodal Decomposable Models for Human Pose Estimation. In: CVPR (2013)
Google Scholar
Wang, Y., Mori, G.: Multiple tree models for occlusion and spatial constraints in human pose estimation. In: Forsyth, D., Torr, P., Zisserman, A. (eds.) ECCV 2008, Part III. LNCS, vol. 5304, pp. 710–724. Springer, Heidelberg (2008)
Chapter Google Scholar
Sigal, L., Black, M.J.: Measure locally, reason globally: Occlusion-sensitive articulated pose estimation. In: CVPR (2006)
Google Scholar
Lan, X., Huttenlocher, D.P.: Beyond trees: Common-factor models for 2d human pose recovery. In: ICCV (2005)
Google Scholar
Karlinsky, L., Ullman, S.: Using linking features in learning non-parametric part models. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012, Part III. LNCS, vol. 7574, pp. 326–339. Springer, Heidelberg (2012)
Chapter Google Scholar
Tian, Y., Zitnick, C.L., Narasimhan, S.G.: Exploring the spatial hierarchy of mixture models for human pose estimation. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012, Part V. LNCS, vol. 7576, pp. 256–269. Springer, Heidelberg (2012)
Chapter Google Scholar
Sun, M., Savarese, S.: Articulated part-based model for joint object detection and pose estimation. In: ICCV (2011)
Google Scholar
Gkioxari, G., Arbeláez, P., Bourdev, L., Malik, J.: Articulated pose estimation using discriminative armlet classifiers. In: CVPR. IEEE (2013)
Google Scholar
Wang, Y., Tran, D., Liao, Z.: Learning hierarchical poselets for human parsing. In: CVPR. IEEE (2011)
Google Scholar
Pishchulin, L., Andriluka, M., Gehler, P., Schiele, B.: Strong appearance and expressive spatial models for human pose estimation. In: ICCV (2013)
Google Scholar
Dantone, M., Gall, J., Leistner, C., Van Gool, L.: Human pose estimation using body parts dependent joint regressors. In: CVPR (2013)
Google Scholar
Bengio, Y.: Learning deep architectures for AI. Foundations and trends in Machine Learning (2009)
Google Scholar
Carvalho, V., Cohen, W.: Stacked sequential learning. In: IJCAI (2005)
Google Scholar
Daumé III, H., Langford, J., Marcu, D.: Search-based structured prediction. Machine Learning (2009)
Google Scholar
Bai, X., Tu, Z.: Auto-context and its application to high-level vision tasks and 3d brain image segmentation. In: PAMI (2009)
Google Scholar
Xiong, X., Munoz, D., Bagnell, J.A., Hebert, M.: 3-d scene analysis via sequenced predictions over points and regions. In: ICRA (2011)
Google Scholar
Wolpert, D.H.: Stacked Generalization. Neural Networks (1992)
Google Scholar
Friedman, J.H.: Greedy function approximation: a gradient boosting machine. Annals of Statistics (2001)
Google Scholar
Caruana, R., Niculescu-Mizil, A.: An empirical comparison of supervised learning algorithms. In: ICML (2006)
Google Scholar
Grubb, A., Bagnell, J.A.: Generalized boosting algorithms for convex optimization. In: ICML (2011)
Google Scholar
Eichner, M., Ferrari, V.: Appearance sharing for collective human pose estimation. In: Lee, K.M., Matsushita, Y., Rehg, J.M., Hu, Z. (eds.) ACCV 2012, Part I. LNCS, vol. 7724, pp. 138–151. Springer, Heidelberg (2013)
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

The Robotics Institute, Carnegie Mellon University, USA
Varun Ramakrishna, Daniel Munoz, Martial Hebert, James Andrew Bagnell & Yaser Sheikh

Authors

Varun Ramakrishna
View author publications
You can also search for this author in PubMed Google Scholar
Daniel Munoz
View author publications
You can also search for this author in PubMed Google Scholar
Martial Hebert
View author publications
You can also search for this author in PubMed Google Scholar
James Andrew Bagnell
View author publications
You can also search for this author in PubMed Google Scholar
Yaser Sheikh
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science, University of Toronto, 6 King’s College Road, M5H 3S5, Toronto, ON, Canada
David Fleet
Faculty of Electrical Engineering, Department of Cybernetics, Czech Technical University in Prague, Technicka 2, 166 27, Prague 6, Czech Republic
Tomas Pajdla
Max-Planck-Institut für Informatik, Campus E1 4, 66123, Saarbrücken, Germany
Bernt Schiele
KU Leuven, ESAT - PSI, iMinds, Kasteelpark Arenberg, 10, Bus 2441, 3001, Leuven, Belgium
Tinne Tuytelaars

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ramakrishna, V., Munoz, D., Hebert, M., Andrew Bagnell, J., Sheikh, Y. (2014). Pose Machines: Articulated Pose Estimation via Inference Machines. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds) Computer Vision – ECCV 2014. ECCV 2014. Lecture Notes in Computer Science, vol 8690. Springer, Cham. https://doi.org/10.1007/978-3-319-10605-2_3

Download citation

DOI: https://doi.org/10.1007/978-3-319-10605-2_3
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-10604-5
Online ISBN: 978-3-319-10605-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Pose Machines: Articulated Pose Estimation via Inference Machines

Abstract

Chapter PDF

Similar content being viewed by others

DeeperCut: A Deeper, Stronger, and Faster Multi-person Pose Estimation Model

Multi-view Pose Estimation with Flexible Mixtures-of-Parts

Integral Human Pose Regression

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Pose Machines: Articulated Pose Estimation via Inference Machines

Abstract

Chapter PDF

Similar content being viewed by others

DeeperCut: A Deeper, Stronger, and Faster Multi-person Pose Estimation Model

Multi-view Pose Estimation with Flexible Mixtures-of-Parts

Integral Human Pose Regression

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation