Facial Landmarks Detector Learned by the Structured Output SVM

Uřičář, Michal; Franc, Vojtěch; Hlaváč, Václav

doi:10.1007/978-3-642-38241-3_26

Michal Uřičář⁶,
Vojtěch Franc⁶ &
Václav Hlaváč⁶

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 359))

3051 Accesses
16 Citations

Abstract

We propose a principled approach to supervised learning of facial landmarks detector based on the Deformable Part Models (DPM). We treat the task of landmarks detection as an instance of the structured output classification. To learn the parameters of the detector we use the Structured Output Support Vector Machines algorithm. The objective function of the learning algorithm is directly related to the performance of the detector and controlled by the user-defined loss function, in contrast to the previous works. Our proposed detector is real-time on a standard computer, simple to implement and easily modifiable for detection of various set of landmarks. We evaluate the performance of our detector on a challenging “Labeled Faces in the Wild” (LFW) database. The empirical results show that our detector consistently outperforms two public domain implementations based on the Active Appearance Models and the DPM. We are releasing open-source code implementing our proposed detector along with the manual annotation of seven facial landmarks for nearly all images in the LFW database.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Beumer, G., Veldhuis, R.: On the accuracy of EERs in face recognition and the importance of reliable registration. In: 5th IEEE Benelux Signal Processing Symposium (SPS 2005), pp. 85–88. IEEE Benelux Signal Processing (2005)
Google Scholar
Cristinacce, D., Cootes, T., Scott, I.: A multi-stage approach to facial feature detection. In: 15th British Machine Vision Conference (BMVC 2004), pp. 277–286 (2004)
Google Scholar
Riopka, T., Boult, T.: The eyes have it. In: Proceedings of ACM SIGMM Multimedia Biometrics Methods and Applications Workshop, pp. 9–16 (2003)
Google Scholar
Cootes, T.F., Edwards, G.J., Taylor, C.J.: Active appearance models. IEEE Trans. Pattern Analysis and Machine Intelligence 23, 681–685 (2001)
Article Google Scholar
Viola, P., Jones, M.: Robust real-time face detection. International Journal of Computer Vision 57, 137–154 (2004)
Article Google Scholar
Beumer, G., Tao, Q., Bazen, A., Veldhuis, R.: A landmark paper in face recognition. In: 7th International Conference on Automatic Face and Gesture Recognition (FGR 2006). IEEE Computer Society Press (2006)
Google Scholar
Cristinacce, D., Cootes, T.: Facial feature detection using AdaBoost with shape constraints. In: 14th Proceedings British Machine Vision Conference (BMVC 2003), pp. 231–240 (2003)
Google Scholar
Erukhimov, V., Lee, K.: A bottom-up framework for robust facial feature detection. In: 8th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2008), pp. 1–6 (2008)
Google Scholar
Wu, J., Trivedi, M.: Robust facial landmark detection for intelligent vehicle system. In: IEEE International Workshop on Analysis and Modeling of Faces and Gestures (2005)
Google Scholar
Crandall, D., Felzenszwalb, P., Huttenlocher, D.: Spatial priors for part-based recognition using statistical models. In: CVPR, pp. 10–17 (2005)
Google Scholar
Felzenszwalb, P.F., Huttenlocher, D.P.: Pictorial structures for object recognition. Internatinal Journal of Computer Vision 61, 55–79 (2005)
Article Google Scholar
Felzenszwalb, P.F., Girshick, R.B., McAllester, D., Ramanan, D.: Object detection with discriminatively trained part based models. IEEE Transactions on Pattern Analysis and Machine Intelligence 99 (2009)
Google Scholar
Fischler, M.A., Elschlager, R.A.: The representation and matching of pictorial structures. IEEE Transactions on Computers C-22, 67–92 (1973)
Article Google Scholar
Everingham, M., Sivic, J., Zisserman, A.: “Hello! My name is.. Buffy” – automatic naming of characters in TV video. In: Proceedings of the British Machine Vision Conference (2006)
Google Scholar
Everingham, M., Sivic, J., Zisserman, A.: Taking the bite out of automatic naming of characters in TV video. Image and Vision Computing 27 (2009)
Google Scholar
Sivic, J., Everingham, M., Zisserman, A.: “Who are you?” – learning person specific classifiers from video. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2009)
Google Scholar
Tsochantaridis, I., Joachims, T., Hofmann, T., Altun, Y., Singer, Y.: Large margin methods for structured and interdependent output variables. Journal of Machine Learning Research 6, 1453–1484 (2005)
MATH Google Scholar
Huang, G.B., Ramesh, M., Berg, T., Learned-Miller, E.: Labeled faces in the wild: A database for studying face recognition in unconstrained environments. Technical Report 07-49, University of Massachusetts, Amherst (2007)
Google Scholar
Teo, C.H., Vishwanthan, S., Smola, A.J., Le, Q.V.: Bundle methods for regularized risk minimization. J. Mach. Learn. Res. 11, 311–365 (2010)
MathSciNet MATH Google Scholar
Bordes, A., Bottou, L., Gallinari, P.: SGD-QN: Careful quasi-newton stochastic gradient descent. Journal of Machine Learning Research 10, 1737–1754 (2009)
MathSciNet MATH Google Scholar
Heikkilä, M., Pietikäinen, M., Schmid, C.: Description of interest regions with local binary patterns. Pattern Recognition 42, 425–436 (2009)
Article MATH Google Scholar
Franc, V., Sonnenburg, S.: LIBOCAS — library implementing OCAS solver for training linear svm classifiers from large-scale data (2010), http://cmp.felk.cvut.cz/~xfrancv/ocas/html/index.html
Kroon, D.J.: Active shape model (ASM) and active appearance model (AAM). MATLAB Central (2010), http://www.mathworks.com/matlabcentral/fileexchange/26706-active-shape-model-asm-and-active-appearance-model-aam
Nordstrøm, M.M., Larsen, M., Sierakowski, J., Stegmann, M.B.: The IMM face database - an annotated dataset of 240 face images. Technical report, Informatics and Mathematical Modelling, Technical University of Denmark, DTU (2004)
Google Scholar
Everingham, M., Sivic, J., Zisserman, A.: Willow project, automatic naming of characters in tv video. MATLAB implementation (2008), http://www.robots.ox.ac.uk/~vgg/research/nface/index.html

Download references

Author information

Authors and Affiliations

Center for Machine Perception, Department of Cybernetics, Faculty of Electrical Engineering, Czech Technical University in Prague, Technická 2, 166 27, Prague 6, Czech Republic
Michal Uřičář, Vojtěch Franc & Václav Hlaváč

Authors

Michal Uřičář
View author publications
You can also search for this author in PubMed Google Scholar
Vojtěch Franc
View author publications
You can also search for this author in PubMed Google Scholar
Václav Hlaváč
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Xerox Research Centre Europe, 6 ch. de Maupertuis, 38240, Meylan, France
Gabriela Csurka
Aalborg University, Niels Jernes Vej 14, 3-215A, 9220, Aalborg Ø, Denmark
Martin Kraus
Department of Computer Science, University of Wales, Swansea, UK
Robert S. Laramee
LISA-ISTIA, Université d’Angers, 62, avenue Notre-Dame du Lac, 49000, Angers, France
Paul Richard
Departamento de Sistemas e Informática, Escola Superior de Tecnologia do IPS, Rua do Vale de Chaves Estefanilha, 2910, Setúbal, Portugal
José Braz

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Uřičář, M., Franc, V., Hlaváč, V. (2013). Facial Landmarks Detector Learned by the Structured Output SVM. In: Csurka, G., Kraus, M., Laramee, R.S., Richard, P., Braz, J. (eds) Computer Vision, Imaging and Computer Graphics. Theory and Application. Communications in Computer and Information Science, vol 359. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-38241-3_26

Download citation

DOI: https://doi.org/10.1007/978-3-642-38241-3_26
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-38240-6
Online ISBN: 978-3-642-38241-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics