Abstract
In this paper, we propose a new face landmark detection method. In previous methods, face detection was essential before a face landmark detection. The disadvantage of these methods is that they are greatly affected by the performance of the face detection model. In order to overcome this disadvantage, we proposed a method to simultaneously detect the face region and the face landmark. The basic idea came from 1-stage object detection. The structure of the Yolo v3 model, a representative 1-stage object detection model, was modified to find the landmark, and the loss function for training was modified to learn the coordinates of the landmark. In addition, MobileNet was used as the backbone network to increase the processing speed. In order to check the performance of the proposed model, the model was trained using the 300 W-LP database. It was then tested using Helen and LFPW databases, and the average normalized error was used as the evaluation metric. As a result of the evaluation, it was confirmed that the proposed model has improved performance over the previous methods.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Ahonen, T., Hadid, A., Pietikainen, M.: Face description with local binary patterns: application to face recognition. IEEE Trans. Pattern Anal. Mach. Intell. 28(12), 2037–2041 (2006)
Sun, Y., Wang, X., Tang, X.: Deep learning face representation from predicting 10,000 classes. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. pp. 1891–1898 (2014)
Kemelmacher-Shlizerman, I., Basri, R.: 3D face reconstruction from a single image using a single reference face shape. IEEE Trans. Pattern Anal. Mach. Intell. 33(2), 394–405 (2010)
Saragih, J., Göcke, R.: Learning AAM fitting through simulation. Pattern Recogn. 42(11), 2628–2636 (2009)
Ren, S., Cao, X., Wei, Y., Sun, J.: Face alignment at 3000 fps via regressing local binary features. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. pp. 1685–1692 (2014)
Sagonas, C., Antonakos, E., Tzimiropoulos, G., Zafeiriou, S., Pantic, M.: 300 faces in-the-wild challenge: database and results. Image Vis. Comput. 47, 3–18 (2016)
Zhang, Z., Luo, P., Loy, C.C., Tang, X.: Facial landmark detection by deep multi-task learning. In: European conference on computer vision. pp. 94–108. Springer, Cham. (2014)
Redmon, J., Farhadi, A.: Yolov3: An incremental improvement. arXiv preprint arXiv:1804.02767 (2018)
Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu, C.Y., Berg, A. C.: Ssd: Single shot multibox detector. In European conference on computer vision. pp. 21–37. Springer, Cham (2016)
Howard, A.G., Zhu, M., Chen, B., Kalenichenko, D., Wang, W., Weyand, T., Adam, H.: Mobilenets: efficient convolutional neural networks for mobile vision applications. arXiv preprint arXiv:1704.04861 (2017)
Zhu, X., Lei, Z., Liu, X., Shi, H., Li, S. Z.: Face alignment across large poses: A 3d solution. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. pp. 146–155 (2016)
Zhou, E., Fan, H., Cao, Z., Jiang, Y., Yin, Q.: Extensive facial landmark localization with coarse-to-fine convolutional network cascade. In: Proceedings of the IEEE International Conference on Computer Vision Workshops pp. 386–391 (2013)
Belhumeur, P.N., Jacobs, D.W., Kriegman, D.J., Kumar, N.: Localizing parts of faces using a consensus of exemplars. IEEE Trans. Pattern Anal. Mach. Intell. 35(12), 2930–2940 (2013)
Zhu, X., Ramanan, D.: Face detection, pose estimation, and landmark localization in the wild. In: 2012 IEEE Conference On Computer Vision and Pattern Recognition. pp. 2879–2886. IEEE (2012)
Wu, Y., Ji, Q.: Facial landmark detection: a literature survey. Int. J. Comput. Vis. 127(2), 115–142 (2019)
Tan, M., Le, Q.V.: Efficientnet: Rethinking model scaling for convolutional neural networks. arXiv preprint arXiv:1905.11946 (2019)
Howard, A., Sandler, M., Chu, G., Chen, L.C., Chen, B., Tan, M., Le, Q.V.: Searching for mobilenetv3. In: Proceedings of the IEEE International Conference on Computer Vision pp. 1314–1324 (2019)
Acknowledgement
This work was supported by the NRF(National Research Foundation) of Korea funded by the Korea government (Ministry of Science and ICT) (NRF-2019R1A2C4070681).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this paper
Cite this paper
Kim, T., Mok, J.W., Lee, E.C. (2021). 1-Stage Face Landmark Detection Using Deep Learning. In: Singh, M., Kang, DK., Lee, JH., Tiwary, U.S., Singh, D., Chung, WY. (eds) Intelligent Human Computer Interaction. IHCI 2020. Lecture Notes in Computer Science(), vol 12616. Springer, Cham. https://doi.org/10.1007/978-3-030-68452-5_25
Download citation
DOI: https://doi.org/10.1007/978-3-030-68452-5_25
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-68451-8
Online ISBN: 978-3-030-68452-5
eBook Packages: Computer ScienceComputer Science (R0)