Skip to main content

Overview of Deep Learning in Facial Recognition

  • Chapter
  • First Online:
Implementations and Applications of Machine Learning

Part of the book series: Studies in Computational Intelligence ((SCI,volume 782))

  • 941 Accesses

Abstract

In recent years a wide range of techniques have been developed to achieve accurate automated facial recognition. Ever since the success of the AlexNet neural network model based on convolutional neural networks (CNN) in the ImageNet competition in 2012, algorithms for object detection and recognition based on the so-called deep learning have attained significant improvements in performance. This success has inspired the implementation of similar models in facial recognition, resulting in vastly improved performance. As a result, recent research has for the most part been based on this paradigm. This chapter presents an overview of currently available deep neural network models in facial recognition. We outline the architectures and methods used by the best current models, and discuss performance issues related to the loss function, the optimization method and the choice of training dataset.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. W. AbdAlmageed, Y. Wu, S. Rawls, S. Harel, T. Hassner, I. Masi, J. Choi, J. Lekust, J. Kim, P. Natarajan et al., Face recognition using deep multi-pose representations, in 2016 IEEE Winter Conference on Applications of Computer Vision (WACV) (IEEE, Piscataway, 2016), pp. 1–9

    Google Scholar 

  2. M.M. Adankon, Optimisation de ressources pour la sélection de modèle des SVM. PhD thesis, École de technologie supérieure, 2005

    Google Scholar 

  3. L. Breiman, Bagging predictors. Mach. Learn. 24(2), 123–140 (1996)

    MATH  Google Scholar 

  4. Q. Cao, L. Shen, W. Xie, O.M. Parkhi, A. Zisserman, Vggface2: a dataset for recognising faces across pose and age, in 2018 13th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2018) (IEEE, Piscataway, 2018), pp. 67–74

    Google Scholar 

  5. D. Chen, X. Cao, L. Wang, F. Wen, J. Sun, Bayesian face revisited: a joint formulation, in European Conference on Computer Vision (Springer, Berlin, 2012), pp. 566–579

    Google Scholar 

  6. C. Cortes, V. Vapnik, Support-vector networks. Mach. Learn. 20(3), 273–297 (1995)

    MATH  Google Scholar 

  7. N. Crosswhite, J. Byrne, C. Stauffer, O. Parkhi, Q. Cao, A. Zisserman, Template adaptation for face verification and identification, in 2017 12th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2017) (IEEE, Piscataway, 2017), pp. 1–8

    Google Scholar 

  8. G. Cybenko, Approximation by superpositions of a sigmoidal function. Math. Control Signals Syst. 2(4), 303–314 (1989)

    Article  MathSciNet  Google Scholar 

  9. J. Deng, W. Dong, R. Socher, L.-J. Li, K. Li, L. Fei-Fei, ImageNet: a large-scale hierarchical image database, in IEEE Conference on Computer Vision and Pattern Recognition, 2009. CVPR 2009 (IEEE, Piscataway, 2009), pp. 248–255

    Google Scholar 

  10. W. Deng, J. Hu, N. Zhang, B. Chen, J. Guo, Fine-grained face verification: FGLFW database, baselines, and human-DCMN partnership. Pattern Recogn. 66, 63–73 (2017)

    Article  Google Scholar 

  11. J. Deng, J. Guo, S. Zafeiriou, ArcFace: additive angular margin loss for deep face recognition (2018). Preprint. arXiv: 1801.07698

    Google Scholar 

  12. G. Dreyfus, Apprentissage statistique (Editions Eyrolles, Paris, 2008)

    Google Scholar 

  13. J. Duchi, E. Hazan, Y. Singer, Adaptive subgradient methods for online learning and stochastic optimization. J. Mach. Learn. Res. 12(Jul), 2121–2159 (2011)

    MathSciNet  MATH  Google Scholar 

  14. Y. Freund, R.E. Schapire, A decision-theoretic generalization of on-line learning and an application to boosting. J. Comput. Syst. Sci. 55(1), 119–139 (1997)

    Article  MathSciNet  Google Scholar 

  15. K. Fukushima, S. Miyake, Neocognitron: a self-organizing neural network model for a mechanism of visual pattern recognition, in Competition and Cooperation in Neural Nets (Springer, Berlin, 1982), pp. 267–285

    Google Scholar 

  16. J. Gu, Z. Wang, J. Kuen, L. Ma, A. Shahroudy, B. Shuai, T. Liu, X. Wang, L. Wang, G. Wang et al., Recent advances in convolutional neural networks (2015). Preprint. arXiv: 1512.07108

    Google Scholar 

  17. Y. Guo, L. Zhang, Y. Hu, X. He, J. Gao, MS-Celeb-1M: a dataset and benchmark for large-scale face recognition, in European Conference on Computer Vision (Springer, Cham, 2016), pp. 87–102

    Google Scholar 

  18. A.T. Hadgu, A. Nigam, E. Diaz-Aviles, Large-scale learning with AdaGrad on Spark, in 2015 IEEE International Conference on Big Data (Big Data) (IEEE, Piscataway, 2015), pp. 2828–2830

    Book  Google Scholar 

  19. R. Hadsell, S. Chopra, Y. LeCun, Dimensionality reduction by learning an invariant mapping, in 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’2006), vol. 2, pp. 1735–1742

    Google Scholar 

  20. K. He, X. Zhang, S. Ren, J. Sun, Deep residual learning for image recognition, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2016), pp. 770–778

    Google Scholar 

  21. A. Hermans, L. Beyer, B. Leibe, In defense of the triplet loss for person re-identification (2017). Preprint. arXiv: 1703.07737

    Google Scholar 

  22. G.B. Huang, M. Mattar, T. Berg, E. Learned-Miller, Labeled faces in the wild: a database for studying face recognition in unconstrained environments, in Workshop on Faces in ‘Real-Life’ Images: Detection, Alignment, and Recognition (2008)

    Google Scholar 

  23. D.H. Hubel, T.N. Wiesel, Receptive fields, binocular interaction and functional architecture in the cat’s visual cortex. J. Physiol. 160(1), 106–154 (1962)

    Article  Google Scholar 

  24. K.Th. Kalveram, A modified model of the Hebbian synapse and its role in motor learning. Hum. Mov. Sci. 18(2–3), 185–199 (1999)

    Article  Google Scholar 

  25. I. Kemelmacher-Shlizerman, S.M. Seitz, D. Miller, E. Brossard, The MegaFace benchmark: 1 million faces for recognition at scale, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2016), pp. 4873–4882

    Google Scholar 

  26. A. Krizhevsky, I. Sutskever, G.E. Hinton, ImageNet classification with deep convolutional neural networks, in Advances in Neural Information Processing Systems (2012), pp. 1097–1105

    Google Scholar 

  27. E. Learned-Miller, G.B. Huang, A. RoyChowdhury, H. Li, G. Hua, Labeled faces in the wild: a survey, in Advances in Face Detection and Facial Image Analysis (Springer, Cham, 2016), pp. 189–248

    Google Scholar 

  28. G. Lebrun, Sélection de modèles pour la classification supervisée avec des SVM (Séparateurs à Vaste Marge). Application en traitement et analyse d’images. PhD thesis, Université de Caen Basse-Normandie, 2006

    Google Scholar 

  29. Y. LeCun, B.E. Boser, J.S. Denker, D. Henderson, R.E. Howard, W.E. Hubbard, L.D. Jackel, Handwritten digit recognition with a back-propagation network, in Advances in Neural Information Processing Systems (1990), pp. 396–404

    Google Scholar 

  30. Y. LeCun, Y. Bengio, G. Hinton, Deep learning. Nature 521(7553), 436 (2015)

    Google Scholar 

  31. Z. Liu, P. Luo, X. Wang, X. Tang, Deep learning face attributes in the wild, in Proceedings of International Conference on Computer Vision (ICCV), December (2015)

    Google Scholar 

  32. W. Liu, Y. Wen, Z. Yu, M. Yang, Large-margin softmax loss for convolutional neural networks, in ICML (2016), pp. 507–516

    Google Scholar 

  33. W. Liu, Y. Wen, Z. Yu, M. Li, B. Raj, L. Song, SphereFace: deep hypersphere embedding for face recognition, in The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), vol. 1 (2017), p. 1

    Google Scholar 

  34. X. Liu, M. Kan, W. Wu, S. Shan, X. Chen, VIPLFaceNet: an open source deep face recognition SDK. Front. Comput. Sci. 11(2), 208–218 (2017)

    Article  Google Scholar 

  35. I. Masi, A.T. Tran, T. Hassner, J.T. Leksut, G. Medioni, Do we really need to collect millions of faces for effective face recognition? in European Conference on Computer Vision (Springer, Cham, 2016), pp. 579–596

    Google Scholar 

  36. W.S. McCulloch, W. Pitts, A logical calculus of the ideas immanent in nervous activity. Bull. Math. Biophys. 5(4), 115–133 (1943)

    Article  MathSciNet  Google Scholar 

  37. M. Mehdipour Ghazi, H.K. Ekenel, A comprehensive analysis of deep learning based representation for face recognition, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops (2016), pp. 34–41

    Google Scholar 

  38. A. Nech, I. Kemelmacher-Shlizerman, Level playing field for million scale face recognition, in 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (IEEE, Piscataway, 2017), pp. 3406–3415

    Google Scholar 

  39. O.M. Parkhi, A. Vedaldi, A. Zisserman, Deep face recognition, in British Machine Vision Conference, vol. 1 (2015), p. 6

    Google Scholar 

  40. C. Qi, F. Su, Contrastive-center loss for deep neural networks, in 2017 IEEE International Conference on Image Processing (ICIP) (IEEE, Piscataway, 2017), pp. 2851–2855

    Book  Google Scholar 

  41. F. Rosenblatt, The perceptron: a perceiving and recognizing automaton. Technical report, Technical Report 85-460-1, Cornell Aeronautical Laboratory, 1957

    Google Scholar 

  42. S. Sankaranarayanan, A. Alavi, C. Castillo, R. Chellappa, Triplet probabilistic embedding for face verification and clustering (2016). Preprint. arXiv: 1604.05417

    Google Scholar 

  43. F. Schroff, D. Kalenichenko, J. Philbin, FaceNet: a unified embedding for face recognition and clustering, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2015), pp. 815–823

    Google Scholar 

  44. A. Sidani, T. Sidani, A comprehensive study of the backpropagation algorithm and modifications, in Conference Record Southcon (IEEE, Piscataway, 1994), pp. 80–84

    Google Scholar 

  45. K. Simonyan, A. Zisserman, Very deep convolutional networks for large-scale image recognition (2014). Preprint. arXiv: 1409.1556

    Google Scholar 

  46. N. Srivastava, G. Hinton, A. Krizhevsky, I. Sutskever, R. Salakhutdinov, Dropout: a simple way to prevent neural networks from overfitting. J. Mach. Learn. Res. 15(1), 1929–1958 (2014)

    MathSciNet  MATH  Google Scholar 

  47. Y. Sun, Y. Chen, X. Wang, X. Tang, Deep learning face representation by joint identification-verification, in Advances in Neural Information Processing Systems (2014), pp. 1988–1996

    Google Scholar 

  48. Y. Sun, X. Wang, X. Tang, Deep learning face representation from predicting 10,000 classes, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2014), pp. 1891–1898

    Google Scholar 

  49. Y. Sun, D. Liang, X. Wang, X. Tang, Deepid3: face recognition with very deep neural networks (2015). Preprint. arXiv: 1502.00873

    Google Scholar 

  50. Y. Sun, X. Wang, X. Tang, Deeply learned face representations are sparse, selective, and robust, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2015), pp. 2892–2900

    Google Scholar 

  51. C. Szegedy, W. Liu, Y. Jia, P. Sermanet, S. Reed, D. Anguelov, D. Erhan, V. Vanhoucke, A. Rabinovich, Going deeper with convolutions, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2015), pp. 1–9

    Google Scholar 

  52. Y. Taigman, M. Yang, M. Ranzato, L. Wolf, DeepFace: closing the gap to human-level performance in face verification, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2014), pp. 1701–1708

    Google Scholar 

  53. H. Wang, Y. Wang, Z. Zhou, X. Ji, D. Gong, J. Zhou, Z. Li, W. Liu, CosFace: large margin cosine loss for deep face recognition, in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2018), pp. 5265–5274

    Google Scholar 

  54. K.Q. Weinberger, J. Blitzer, L.K. Saul, Distance metric learning for large margin nearest neighbor classification, in Advances in Neural Information Processing Systems (2006), pp. 1473–1480

    Google Scholar 

  55. Y. Wen, K. Zhang, Z. Li, Y. Qiao, A discriminative feature learning approach for deep face recognition, in European Conference on Computer Vision (Springer, Cham, 2016), pp. 499–515

    Google Scholar 

  56. P.J. Werbos, Backpropagation through time: what it does and how to do it. Proc. IEEE 78(10), 1550–1560 (1990)

    Article  Google Scholar 

  57. J. Yang, P. Ren, D. Zhang, D. Chen, F. Wen, H. Li, G. Hua, Neural aggregation network for video face recognition, in 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (IEEE, Piscataway, 2017), pp. 5216–5225

    Google Scholar 

  58. D. Yi, Z. Lei, S. Liao, S.Z. Li, Learning face representation from scratch (2014). Preprint. arXiv: 1411.7923

    Google Scholar 

  59. M.D. Zeiler, ADADELTA: an adaptive learning rate method (2012). Preprint. arXiv: 1212.5701

    Google Scholar 

  60. M.D. Zeiler, R. Fergus, Visualizing and understanding convolutional networks, in European Conference on Computer Vision (Springer, Cham, 2014), pp. 818–833

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Arnauld Fountsop Nzegha .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Nzegha, A.F., Fendji, J.L.E., Thron, C., Tayou, C.D. (2020). Overview of Deep Learning in Facial Recognition. In: Subair, S., Thron, C. (eds) Implementations and Applications of Machine Learning. Studies in Computational Intelligence, vol 782. Springer, Cham. https://doi.org/10.1007/978-3-030-37830-1_6

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-37830-1_6

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-37829-5

  • Online ISBN: 978-3-030-37830-1

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics