Cascaded Models for Articulated Pose Estimation

Sapp, Benjamin; Toshev, Alexander; Taskar, Ben

doi:10.1007/978-3-642-15552-9_30

Benjamin Sapp¹⁹,
Alexander Toshev¹⁹ &
Ben Taskar¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 6312))

Included in the following conference series:

European Conference on Computer Vision

5880 Accesses
75 Citations

Abstract

We address the problem of articulated human pose estimation by learning a coarse-to-fine cascade of pictorial structure models. While the fine-level state-space of poses of individual parts is too large to permit the use of rich appearance models, most possibilities can be ruled out by efficient structured models at a coarser scale. We propose to learn a sequence of structured models at different pose resolutions, where coarse models filter the pose space for the next level via their max-marginals. The cascade is trained to prune as much as possible while preserving true poses for the final level pictorial structure model. The final level uses much more expensive segmentation, contour and shape features in the model for the remaining filtered set of candidates. We evaluate our framework on the challenging Buffy and PASCAL human pose datasets, improving the state-of-the-art.

Download to read the full chapter text

Chapter PDF

Pose Machines: Articulated Pose Estimation via Inference Machines

DeeperCut: A Deeper, Stronger, and Faster Multi-person Pose Estimation Model

Human Pose Estimation by a Series of Residual Auto-Encoders

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Fischler, M., Elschlager, R.: The representation and matching of pictorial structures. IEEE Transactions on Computers 100, 67–92 (1973)
Article Google Scholar
Felzenszwalb, P., Huttenlocher, D.: Pictorial structures for object recognition. IJCV 61, 55–79 (2005)
Article Google Scholar
Fergus, R., Perona, P., Zisserman, A.: A sparse object category model for efficient learning and exhaustive recognition. In: Proc. CVPR (2005)
Google Scholar
Ramanan, D., Sminchisescu, C.: Training deformable models for localization. In: CVPR, pp. 206–213 (2006)
Google Scholar
Ferrari, V., Marin-Jimenez, M., Zisserman, A.: Progressive search space reduction for human pose estimation. In: Proc. CVPR (2008)
Google Scholar
Andriluka, M., Roth, S., Schiele, B.: Pictorial structures revisited: People detection and articulated pose estimation. In: Proc. CVPR (2009)
Google Scholar
Fleuret, G., Geman, D.: Coarse-to-Fine Face Detection. IJCV 41, 85–107 (2001)
Article MATH Google Scholar
Viola, P., Jones, M.: Robust real-time object detection. IJCV 57, 137–154 (2002)
Article Google Scholar
Lan, X., Huttenlocher, D.: Beyond trees: Common-factor models for 2d human pose recovery. In: Proc. ICCV, pp. 470–477 (2005)
Google Scholar
Mori, G., Ren, X., Efros, A., Malik, J.: Recovering human body configurations: Combining segmentation and recognition. In: CVPR (2004)
Google Scholar
Ramanan, D.: Learning to parse images of articulated bodies. In: NIPS (2006)
Google Scholar
Ferrari, V., Marin-Jimenez, M., Zisserman, A.: Pose search: retrieving people using their pose. In: Proc. CVPR (2009)
Google Scholar
Weiss, D., Taskar, B.: Structured prediction cascades. In: Proc. AISTATS (2010)
Google Scholar
Carreras, X., Collins, M., Koo, T.: TAG, dynamic programming, and the perceptron for efficient, feature-rich parsing. In: Proc. CoNLL (2008)
Google Scholar
Petrov, S.: Coarse-to-Fine Natural Language Processing. PhD thesis, University of California at Bekeley (2009)
Google Scholar
Felzenszwalb, P., Girshick, R., McAllester, D.: Cascade Object Detection with Deformable Part Models. In: Proc. CVPR (2010)
Google Scholar
Srinivasan, P., Shi, J.: Bottom-up recognition and parsing of the human body. In: ICCV 2005, pp. 824–831. IEEE Computer Society, Los Alamitos (2007)
Google Scholar
Shalev-Shwartz, S., Singer, Y., Srebro, N.: Pegasos: Primal estimated sub-gradient SOlver for SVM. In: Proc. ICML (2007)
Google Scholar
Felzenszwalb, P., Girshick, R., McAllester, D., Ramanan, D.: Object Detection with Discriminatively Trained Part Based Models. In: PAMI (2008)
Google Scholar
Friedman, J., Hastie, T., Tibshirani, R.: Additive logistic regression: A statistical view of boosting. The Annals of Statistics 28, 337–374 (2000)
Article MATH MathSciNet Google Scholar
Eichner, M., Ferrari, V.: Better appearance models for pictorial structures. In: Proc. BMVC (2009)
Google Scholar
Freund, Y., Schapire, R.: A decision-theoretic generalization of on-line learning and an application to boosting. JCSS 55, 119–139 (1997)
MATH MathSciNet Google Scholar
Ramanan, D., Forsyth, D., Zisserman, A.: Strike a pose: Tracking people by finding stylized poses. In: Proc. CVPR, vol. 1, p. 271 (2005)
Google Scholar
Cour, T., Benezit, F., Shi, J.: Spectral segmentation with multiscale graph decomposition. In: Proc. CVPR (2005)
Google Scholar
Sapp, B., Jordan, C., Taskar, B.: Adaptive pose priors for pictorial structures. In: CVPR (2010)
Google Scholar

Download references

Author information

Authors and Affiliations

University of Pennsylvania, Philadelphia, PA, 19104, USA
Benjamin Sapp, Alexander Toshev & Ben Taskar

Authors

Benjamin Sapp
View author publications
You can also search for this author in PubMed Google Scholar
Alexander Toshev
View author publications
You can also search for this author in PubMed Google Scholar
Ben Taskar
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

GRASP Laboratory, University of Pennsylvania, 3330 Walnut Street, 19104, Philadelphia, PA, USA
Kostas Daniilidis
School of Electrical and Computer Engineering, National Technical University of Athens, 15773, Athens, Greece
Petros Maragos
Department of Applied Mathematics, Ecole Centrale de Paris, Grande Voie des Vignes, 92295, Chatenay-Malabry, France
Nikos Paragios

1 Electronic Supplementary Material

Electronic Supplementary Material (1 KB)

Electronic Supplementary Material (10 KB)

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Sapp, B., Toshev, A., Taskar, B. (2010). Cascaded Models for Articulated Pose Estimation. In: Daniilidis, K., Maragos, P., Paragios, N. (eds) Computer Vision – ECCV 2010. ECCV 2010. Lecture Notes in Computer Science, vol 6312. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-15552-9_30

Download citation

DOI: https://doi.org/10.1007/978-3-642-15552-9_30
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-15551-2
Online ISBN: 978-3-642-15552-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Cascaded Models for Articulated Pose Estimation

Abstract

Chapter PDF

Similar content being viewed by others

Pose Machines: Articulated Pose Estimation via Inference Machines

DeeperCut: A Deeper, Stronger, and Faster Multi-person Pose Estimation Model

Human Pose Estimation by a Series of Residual Auto-Encoders

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

1 Electronic Supplementary Material

Electronic Supplementary Material (1 KB)

Electronic Supplementary Material (10 KB)

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Cascaded Models for Articulated Pose Estimation

Abstract

Chapter PDF

Similar content being viewed by others

Pose Machines: Articulated Pose Estimation via Inference Machines

DeeperCut: A Deeper, Stronger, and Faster Multi-person Pose Estimation Model

Human Pose Estimation by a Series of Residual Auto-Encoders

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

1 Electronic Supplementary Material

Electronic Supplementary Material (1 KB)

Electronic Supplementary Material (10 KB)

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation