Fast Multi-aspect 2D Human Detection

  • Tai-Peng Tian
  • Stan Sclaroff
Part of the Lecture Notes in Computer Science book series (LNCS, volume 6313)


We address the problem of detecting human figures in images, taking into account that the image of the human figure may be taken from a range of viewpoints. We capture the geometric deformations of the 2D human figure using an extension of the Common Factor Model (CFM) of Lan and Huttenlocher. The key contribution of the paper is an improved iterative message passing inference algorithm that runs faster than the original CFM algorithm. This is based on the insight that messages created using the distance transform are shift invariant and therefore messages can be created once and then shifted for subsequent iterations. Since shifting (O(1) complexity) is faster than computing a distance transform (O(n) complexity), a significant speedup is observed in the experiments. We demonstrate the effectiveness of the new model for the human parsing problem using the Iterative Parsing data set and results are competitive with the state of the art detection algorithm of Andriluka, et al.


Body Part Detection Result Inference Algorithm Compatibility Function Viewpoint Change 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


  1. 1.
    Seemann, E., Leibe, B., Schiele, B.: Multi-aspect detection of articulated objects. In: Proc. CVPR (2006)Google Scholar
  2. 2.
    Lan, X., Huttenlocher, D.P.: Beyond trees: Common-factor models for 2D human pose recovery. In: Proc. ICCV (2005)Google Scholar
  3. 3.
    Ramanan, D.: Learning to parse images of articulated objects. In: Proc. NIPS (2006)Google Scholar
  4. 4.
    Andriluka, M., Roth, S., Schiele, B.: Pictorial structures revisited: People detection and articulated pose estimation. In: Proc. CVPR (2009)Google Scholar
  5. 5.
    Felzenszwalb, P.F., Huttenlocher, D.P.: Pictorial structures for object recognition. IJCV 61, 55–79 (2005)CrossRefGoogle Scholar
  6. 6.
    Ferrari, V., Marin-Jimenez, M., Zisserman, A.: Progressive search space reduction for human pose estimation. In: Proc. CVPR (2008)Google Scholar
  7. 7.
    Ren, X., Berg, A.C., Malik, J.: Recovering human body configurations using pairwise constraints between parts. In: Proc. ICCV (2005)Google Scholar
  8. 8.
    Jiang, H., Martin, D.R.: Global pose estimation using non-tree models. In: Proc. CVPR (2008)Google Scholar
  9. 9.
    Crandall, D., Felzenszwalb, P., Huttenlocher, D.: Spatial priors for part-based recognition using statistical models. In: Proc. CVPR (2005)Google Scholar
  10. 10.
    Bergtholdt, M., Kappes, J., Schmidt, S., Schnorr, C.: A study of part-based object class detection using complete graphs. IJCV 28, 416–431 (2009)Google Scholar
  11. 11.
    Kumar, M.P., Koller, D.: Learning a small mixture of trees. In: Proc. NIPS (2009)Google Scholar
  12. 12.
    Lan, X., Huttenlocher, D.: A unified spatio-temporal articulated model for tracking. In: Proc. CVPR (2004)Google Scholar
  13. 13.
    Buehler, P., Everingham, M., Huttenlocher, D.P., Zisserman, A.: Long term arm and hand tracking for continuous sign language TV broadcasts. In: Proc. BMVC (2008)Google Scholar
  14. 14.
    Tuzel, O., Porikli, F., Meer, P.: Region covariance: A fast descriptor for detection and classification. In: Leonardis, A., Bischof, H., Pinz, A. (eds.) ECCV 2006. LNCS, vol. 3952, pp. 589–600. Springer, Heidelberg (2006)CrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2010

Authors and Affiliations

  • Tai-Peng Tian
    • 1
  • Stan Sclaroff
    • 1
  1. 1.Department of Computer ScienceBoston University 

Personalised recommendations