Exploiting Depth and Intensity Information for Head Pose Estimation with Random Forests and Tensor Models

Kaymak, Sertan; Patras, Ioannis

doi:10.1007/978-3-642-37484-5_14

Exploiting Depth and Intensity Information for Head Pose Estimation with Random Forests and Tensor Models

Sertan Kaymak¹⁸ &
Ioannis Patras¹⁸

Conference paper

2818 Accesses
5 Citations

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 7729))

Abstract

Real-time accurate head pose estimation is required for several applications. Methods based on 2D images might not provide accurate and robust head pose measurements due to large head pose variations and illumination changes. Robust and accurate head pose estimation can be achieved by integrating intensity and depth information. In this paper we introduce a head pose estimation system that employs random forests and tensor regression algorithms. The former allow the modeling of large head pose variations using large sets of training data, while the latter allow the estimation of more accurate head pose parameters. The combination of the above mentioned methods results in more robust and accurate predictions for large head pose variations. We also study the fusion of different sources of information (intensity and depth images) to determine how their combination affects the performance of a head pose estimation system. The efficiency of the proposed framework is tested on the Biwi Kinect Head Pose dataset, where it is shown that the proposed methodology outperforms typical random forests.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Fanelli, G., Gall, J., Van Gool, L.: Real Time Head Pose Estimation with Random Regression Forests. In: Computer Vision and Pattern Recognition, CVPR, pp. 617–624 (2011)
Google Scholar
Fanelli, G., Weise, T., Gall, J., Van Gool, L.: Real Time Head Pose Estimation from Consumer Depth Cameras. In: Mester, R., Felsberg, M. (eds.) DAGM 2011. LNCS, vol. 6835, pp. 101–110. Springer, Heidelberg (2011)
Chapter Google Scholar
Breiman, L.: Random Forests. Machine Learning 45, 5–32 (2001)
Article MATH Google Scholar
Guo, W., Kotsia, I., Patras, I.: Tensor Learning for Regression. IEEE Transactions on Image Processing 21, 816–827 (2012)
Article MathSciNet Google Scholar
Breitenstein, M.D., Kuettel, D., Weise, T., Van Gool, L., Pfister, H.: Real-time face pose estimation from single range images. In: IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2008, pp. 1–8 (2008)
Google Scholar
Kolda, T.G., Bader, B.W.: Tensor Decompositions and Applications. SIAM Review 51, 455–500 (2009)
Article MathSciNet MATH Google Scholar
Seemann, E., Nickel, K., Stiefelhagen, R.: Head pose estimation using stereo vision for human-robot interaction. In: Proceedings of the Sixth IEEE International Conference on Automatic Face and Gesture Recognition, pp. 626–631 (2004)
Google Scholar
Morency, L.P., Sundberg, P., Darrell, T.: Pose estimation using 3D view-based eigenspaces. In: IEEE International Workshop on Analysis and Modeling of Faces and Gestures, AMFG 2003, pp. 45–52 (2003)
Google Scholar
Osadchy, M., Cun, Y.L., Miller, M.L.: Synergistic Face Detection and Pose Estimation with Energy-Based Models. J. Mach. Learn. Res. 8, 1197–1215 (2007)
Google Scholar
Vatahska, T., Bennewitz, M., Behnke, S.: Feature-based head pose estimation from images. In: 2007 7th IEEE-RAS International Conference on Humanoid Robots, pp. 330–335. IEEE (2007)
Google Scholar
Whitehill, J., Movellan, J.R.: A discriminative approach to frame-by-frame head pose tracking. In: 8th IEEE International Conference on Automatic Face Gesture Recognition, FG 2008, pp. 1–7 (2008)
Google Scholar
Cootes, T.F., Edwards, G.J., Taylor, C.J.: Active appearance models. IEEE Transactions on Pattern Analysis and Machine Intelligence 23, 681–685 (2001)
Article Google Scholar
Ramnath, K., Koterba, S., Xiao, J., Hu, C., Matthews, I., Baker, S., Cohn, J., Kanade, T.: Multi-view AAM fitting and construction. International Journal of Computer Vision 76, 183–204 (2008)
Article Google Scholar
Blanz, V., Vetter, T.: A morphable model for the synthesis of 3D faces. In: Proceedings of the 26th Annual Conference on Computer Graphics and Interactive Techniques, pp. 187–194. ACM Press/Addison-Wesley Publishing Co. (1999)
Google Scholar
Storer, M., Urschler, M., Bischof, H.: 3d-mam: 3d morphable appearance model for efficient fine head pose estimation from still images. In: 2009 IEEE 12th International Conference on Computer Vision Workshops, ICCV Workshops, pp. 192–199. IEEE (2009)
Google Scholar
Cristinacce, D., Cootes, T.: Feature detection and tracking with constrained local models, pp. 929–938 (2006)
Google Scholar
Murphy-Chutorian, E., Trivedi, M.M.: Head Pose Estimation and Augmented Reality Tracking: An Integrated System and Evaluation for Monitoring Driver Awareness. IEEE Transactions on Intelligent Transportation Systems 11, 300–311 (2010)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Queen Mary, University of London, UK
Sertan Kaymak & Ioannis Patras

Authors

Sertan Kaymak
View author publications
You can also search for this author in PubMed Google Scholar
Ioannis Patras
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Computer Science and Engineering, Hanyang University, 222 Wangshimni-ro, Seongdong-gu, 133-791, Seoul, South Korea
Jong-Il Park
Department of Electrical Engineering, KAIST, 291 Daehak-ro, Yuseong-gu, 305-701, Daejeon, South Korea
Junmo Kim

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kaymak, S., Patras, I. (2013). Exploiting Depth and Intensity Information for Head Pose Estimation with Random Forests and Tensor Models. In: Park, JI., Kim, J. (eds) Computer Vision - ACCV 2012 Workshops. ACCV 2012. Lecture Notes in Computer Science, vol 7729. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-37484-5_14

Download citation

DOI: https://doi.org/10.1007/978-3-642-37484-5_14
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-37483-8
Online ISBN: 978-3-642-37484-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics