1-Stage Face Landmark Detection Using Deep Learning

Kim, Taehyung; Mok, Ji Won; Lee, Eui Chul

doi:10.1007/978-3-030-68452-5_25

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 12616))

Included in the following conference series:

International Conference on Intelligent Human Computer Interaction

1110 Accesses

Abstract

In this paper, we propose a new face landmark detection method. In previous methods, face detection was essential before a face landmark detection. The disadvantage of these methods is that they are greatly affected by the performance of the face detection model. In order to overcome this disadvantage, we proposed a method to simultaneously detect the face region and the face landmark. The basic idea came from 1-stage object detection. The structure of the Yolo v3 model, a representative 1-stage object detection model, was modified to find the landmark, and the loss function for training was modified to learn the coordinates of the landmark. In addition, MobileNet was used as the backbone network to increase the processing speed. In order to check the performance of the proposed model, the model was trained using the 300 W-LP database. It was then tested using Helen and LFPW databases, and the average normalized error was used as the evaluation metric. As a result of the evaluation, it was confirmed that the proposed model has improved performance over the previous methods.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Ahonen, T., Hadid, A., Pietikainen, M.: Face description with local binary patterns: application to face recognition. IEEE Trans. Pattern Anal. Mach. Intell. 28(12), 2037–2041 (2006)
Article Google Scholar
Sun, Y., Wang, X., Tang, X.: Deep learning face representation from predicting 10,000 classes. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. pp. 1891–1898 (2014)
Google Scholar
Kemelmacher-Shlizerman, I., Basri, R.: 3D face reconstruction from a single image using a single reference face shape. IEEE Trans. Pattern Anal. Mach. Intell. 33(2), 394–405 (2010)
Article Google Scholar
Saragih, J., Göcke, R.: Learning AAM fitting through simulation. Pattern Recogn. 42(11), 2628–2636 (2009)
Article Google Scholar
Ren, S., Cao, X., Wei, Y., Sun, J.: Face alignment at 3000 fps via regressing local binary features. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. pp. 1685–1692 (2014)
Google Scholar
Sagonas, C., Antonakos, E., Tzimiropoulos, G., Zafeiriou, S., Pantic, M.: 300 faces in-the-wild challenge: database and results. Image Vis. Comput. 47, 3–18 (2016)
Article Google Scholar
Zhang, Z., Luo, P., Loy, C.C., Tang, X.: Facial landmark detection by deep multi-task learning. In: European conference on computer vision. pp. 94–108. Springer, Cham. (2014)
Google Scholar
Redmon, J., Farhadi, A.: Yolov3: An incremental improvement. arXiv preprint arXiv:1804.02767 (2018)
Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu, C.Y., Berg, A. C.: Ssd: Single shot multibox detector. In European conference on computer vision. pp. 21–37. Springer, Cham (2016)
Google Scholar
Howard, A.G., Zhu, M., Chen, B., Kalenichenko, D., Wang, W., Weyand, T., Adam, H.: Mobilenets: efficient convolutional neural networks for mobile vision applications. arXiv preprint arXiv:1704.04861 (2017)
Zhu, X., Lei, Z., Liu, X., Shi, H., Li, S. Z.: Face alignment across large poses: A 3d solution. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. pp. 146–155 (2016)
Google Scholar
Zhou, E., Fan, H., Cao, Z., Jiang, Y., Yin, Q.: Extensive facial landmark localization with coarse-to-fine convolutional network cascade. In: Proceedings of the IEEE International Conference on Computer Vision Workshops pp. 386–391 (2013)
Google Scholar
Belhumeur, P.N., Jacobs, D.W., Kriegman, D.J., Kumar, N.: Localizing parts of faces using a consensus of exemplars. IEEE Trans. Pattern Anal. Mach. Intell. 35(12), 2930–2940 (2013)
Article Google Scholar
Zhu, X., Ramanan, D.: Face detection, pose estimation, and landmark localization in the wild. In: 2012 IEEE Conference On Computer Vision and Pattern Recognition. pp. 2879–2886. IEEE (2012)
Google Scholar
Wu, Y., Ji, Q.: Facial landmark detection: a literature survey. Int. J. Comput. Vis. 127(2), 115–142 (2019)
Article Google Scholar
Tan, M., Le, Q.V.: Efficientnet: Rethinking model scaling for convolutional neural networks. arXiv preprint arXiv:1905.11946 (2019)
Howard, A., Sandler, M., Chu, G., Chen, L.C., Chen, B., Tan, M., Le, Q.V.: Searching for mobilenetv3. In: Proceedings of the IEEE International Conference on Computer Vision pp. 1314–1324 (2019)
Google Scholar

Download references

Acknowledgement

This work was supported by the NRF(National Research Foundation) of Korea funded by the Korea government (Ministry of Science and ICT) (NRF-2019R1A2C4070681).

Author information

Authors and Affiliations

Department of Artificial Intelligence and Informatics, Graduate School, Sangmyung University, Seoul, 03016, Republic of Korea
Taehyung Kim & Ji Won Mok
Department of Human-Centered Artificial Intelligence, Sangmyung University, Seoul, 03016, Republic of Korea
Eui Chul Lee

Authors

Taehyung Kim
View author publications
You can also search for this author in PubMed Google Scholar
Ji Won Mok
View author publications
You can also search for this author in PubMed Google Scholar
Eui Chul Lee
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Eui Chul Lee .

Editor information

Editors and Affiliations

Woosong University, Daejeon, Korea (Republic of)
Madhusudan Singh
Dongseo University, Busan, Korea (Republic of)
Dae-Ki Kang
Keimyung University, Daegu, Korea (Republic of)
Jong-Ha Lee
Indian Institute of Information Technology, Allahabad, India
Uma Shanker Tiwary
Hankuk University of Foreign Studies, Yongin, Korea (Republic of)
Dhananjay Singh
Pukyong National University, Busan, Korea (Republic of)
Wan-Young Chung

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kim, T., Mok, J.W., Lee, E.C. (2021). 1-Stage Face Landmark Detection Using Deep Learning. In: Singh, M., Kang, DK., Lee, JH., Tiwary, U.S., Singh, D., Chung, WY. (eds) Intelligent Human Computer Interaction. IHCI 2020. Lecture Notes in Computer Science(), vol 12616. Springer, Cham. https://doi.org/10.1007/978-3-030-68452-5_25

Download citation

DOI: https://doi.org/10.1007/978-3-030-68452-5_25
Published: 06 February 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-68451-8
Online ISBN: 978-3-030-68452-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics