Evaluation of Head Pose Estimation for Studio Data

  • Jilin Tu
  • Yun Fu
  • Yuxiao Hu
  • Thomas Huang
Part of the Lecture Notes in Computer Science book series (LNCS, volume 4122)


This paper introduces our head pose estimation system that localizes nose-tip of the faces and estimate head poses in studio quality pictures. After the nose-tip in the training data are manually labeled, the appearance variation caused by head pose changes is characterized by tensor model. Given images with unknown head pose and nose-tip location, the nose-tip of the face is localized in a coarse-to-fine fashion, and the head pose is estimated simultaneously by the head pose tensor model. The image patches at the localized nose tips are then cropped and sent to two other head pose estimators based on LEA and PCA techniques. We evaluated our system on the Pointing’04 head pose image database. With the nose-tip location known, our head pose estimators can achieve 94~96% head pose classification accuracy(within ±15 o ). With nose-tip unknown, we achieves 85% nose-tip localization accuracy(within 3 pixels from the ground truth), and 81~84% head pose classification accuracy(within ±15 o ).


Tilt Angle Image Patch Tensor Model Laplacian Pyramid Core Tensor 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Wu, J.W., Pedersen, J.M., Putthividhya, D., Norgaard, D., Trivedi, M.M.: A Two-level Pose Estimation Framework Using Majority Voting of Gabor Wavelets and Bunch Graph Analysis. In: Pointing 2004 (2004)Google Scholar
  2. 2.
    Rainer Stiefelhagen: Estimating Head Pose with Neural Networks-Results on the Pointing04 ICPR Workshop Evaluation Data. In: Pointing 2004 (2004)Google Scholar
  3. 3.
    Gourier, N., Hall, D., Crowley, J.L.: Estimating Face orientation from Robust Detection of Salient Facial Structures. In: Pointing 2004 (2004)Google Scholar
  4. 4.
    Burt, P.J., Adelson, E.H.: The Laplacian pyramid as a compact image code. IEEE Trans. Commun. 31(4), 532–540 (1983)CrossRefGoogle Scholar
  5. 5.
    Vasilescu, M.A.O., Terzopoulos, D.: Multilinear Subspace Analysis for Image Ensembles. In: Proc. Computer Vision and Pattern Recognition Conf., vol. 2, pp. 93–99 (2003)Google Scholar
  6. 6.
    Vasilescu, M.A.O., Terzopoulos, D.: Multilinear Independent Components Analysis. In: Proc. Computer Vision and Pattern Recognition Conf. (2005)Google Scholar
  7. 7.
    Jones, M.J., Rehg, J.M.: Statistical Color Models with Application to Skin Detection. Int. J. of Computer Vision 46(1), 81–96 (2002)zbMATHCrossRefGoogle Scholar
  8. 8.
    Fu, Y., Huang, T.S.: Graph Embedded Analysis for Head Pose Estimation. In: 7th IEEE International Conference Automatic Face and Gesture Recognition (2006)Google Scholar
  9. 9.
    Turk, M., Pentland, A.: Eigenfaces for recognition. Journal of Cognitive Neuroscience 3(1), 71–86 (1991)CrossRefGoogle Scholar
  10. 10.
    Ostu, N.: A thresholding selection method from gray-level histograms, IEEE Trans. IEEE Trans. Systems, Man & Cybernetics 9(1), 62–66 (1979)CrossRefGoogle Scholar

Copyright information

© Springer Berlin Heidelberg 2007

Authors and Affiliations

  • Jilin Tu
    • 1
  • Yun Fu
    • 1
  • Yuxiao Hu
    • 1
  • Thomas Huang
    • 1
  1. 1.Beckman Institute, University of Illinois at Urbana-Champaign, 405 N Mathews Ave, Urbana, IL 61801USA

Personalised recommendations