User-Invariant Facial Animation with Convolutional Neural Network

Wang, Shuiquan; Cheng, Zhengxin; Chang, Liang; Qiao, Xuejun; Duan, Fuqing

doi:10.1007/978-3-030-04167-0_25

Shuiquan Wang¹⁶,
Zhengxin Cheng¹⁶,
Liang Chang¹⁶,
Xuejun Qiao¹⁷ &
…
Fuqing Duan¹⁶

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 11301))

Included in the following conference series:

International Conference on Neural Information Processing

3671 Accesses

Abstract

In this paper, we propose a robust approach for real-time user-invariant and performance-based face animation system using a single ordinary RGB camera with convolutional neural network (CNN), where the facial expression coefficients are used to drive the avatar. Existing shape regression algorithms usually take a two-step procedure to estimate facial expressions: The first is to estimate the 3D positions of facial landmarks, and the second is computing the head poses and expression coefficients. The proposed method directly regresses the face expression coefficients by using CNN. This single-shot regressor for facial expression coefficients is faster than the state-of-the-art single web camera based face animation system. Moreover, our method can avoid the user-specific 3D blendshapes, and thus it is user-invariant. Three different input size CNN architectures are designed and combined with Smoothed L1 and Gaussian loss functions to regress the expression coefficients. Experiments validate the proposed method.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Cao, C., Weng, Y., Lin, S.: 3D shape regression for real-time facial animation. ACM Trans. Graph. 32(4), 96 (2013)
Article Google Scholar
Huang, H., Chai, J., Tong, X.: Leveraging motion capture and 3D scanning for high-fidelity facial performance acquisition. ACM Trans. Graph. 30(4), 76–79 (2011)
Article Google Scholar
Zhang, L., Snavely, N., Curless, B.: Spacetime faces: high resolution capture for modeling and animation. ACM Trans. Graph. 23(3), 546–556 (2008)
Google Scholar
Bradley, D., Heidrich, W., Popa, T.: High resolution passive facial performance capture. ACM Trans. Graph. 29(4), 157–166 (2010)
Article Google Scholar
Weise, T., Bouaziz, S., Li, H.: Realtime performance-based facial animation. ACM Trans. Graph. 30(4), 76–79 (2011)
Article Google Scholar
Sauer, P., Cootes, T., Taylor, C.: Accurate regression procedures for active appearance models. In: BMVC, vol. 1 no. 6, pp. 681–685 (2011)
Google Scholar
Cao, C., Weng, Y., Zhou, S.: FaceWarehouse: a 3D facial expression database for visual computing. IEEE Trans. Visual Comput. Graphics 20(3), 413–425 (2014)
Article Google Scholar
Zhang, K., Zhang, Z., Li, Z.: Joint face detection and alignment using multitask cascaded convolutional networks. IEEE Sig. Process. Lett. 23(10), 1499–1503 (2016)
Article Google Scholar
Kang, B.N., Kim, Y., Kim, D.: Deep convolution neural network with stacks of multi-scale convolutional layer block using triplet of faces for face recognition in the wild. In: IEEE International Conference on Systems, Man, and Cybernetics, pp. 4460–4465 (2017)
Google Scholar
Levi, G., Hassncer, T.: Age and gender classification using convolutional neural networks. In: IEEE Conference on Computer Vision and Pattern Recognition Workshops, pp. 34–42 (2015)
Google Scholar
Ranjan, R., Sankaranarayanan, S., Castillo, C.D.: An all-in-one convolutional neural network for face analysis. In: 12th IEEE International Conference on Automatic Face and Gesture Recognition, pp. 17–24(2017)
Google Scholar
Xiong, X., Torre, F.D.L.: Supervised descent method and its applications to face alignment. In: IEEE Conference on Computer Vision and Pattern Recognition, vol. 9, no. 4, pp. 532–539 (2013)
Google Scholar
Ranjan, R., Zhou, S., Chen, J.C.: Unconstrained age estimation with deep convolutional neural networks. In: IEEE International Conference on Computer Vision Workshop, pp. 351–359 (2015)
Google Scholar
Ekman, P., Friesen, W.V.: Facial action coding system: a technique for the measurement of facial movement. Rivista Di Psichiatria 47(2), 126–38 (1978)
Google Scholar
Weng, Y., Cao, C., Hou, Q.: Real-time facial animation on mobile devices. Graph. Models 76(3), 172–179 (2013)
Article Google Scholar
Redmon, R.: Darknet: open source neural networks in C. http://pjreddie.com/darknet/ (2013–2016)
Wu, Y., Hassner, T., Kim, K., et al.: Facial landmark detection with tweaked convolutional neural networks. IEEE Trans. Pattern Anal. Mach. Intell. 99, 1 (2015)
Google Scholar
Girshick, R.: Fast R-CNN. In: IEEE International Conference on Computer Vision, pp. 1440–1448 (2015)
Google Scholar

Download references

Acknowledgments

This work was supported by the National Natural Science Foundation of China under Grant No. 61572078.

Author information

Authors and Affiliations

College of Information Science and Technology, Beijing Normal University, Beijing, 100875, China
Shuiquan Wang, Zhengxin Cheng, Liang Chang & Fuqing Duan
School of Science, Xi’an University of Architecture and Technology, Xi’an, 710055, China
Xuejun Qiao

Authors

Shuiquan Wang
View author publications
You can also search for this author in PubMed Google Scholar
Zhengxin Cheng
View author publications
You can also search for this author in PubMed Google Scholar
Liang Chang
View author publications
You can also search for this author in PubMed Google Scholar
Xuejun Qiao
View author publications
You can also search for this author in PubMed Google Scholar
Fuqing Duan
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Fuqing Duan .

Editor information

Editors and Affiliations

The Chinese Academy of Sciences, Beijing, China
Long Cheng
City University of Hong Kong, Kowloon, Hong Kong
Andrew Chi Sing Leung
Kobe University, Kobe, Japan
Seiichi Ozawa

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Wang, S., Cheng, Z., Chang, L., Qiao, X., Duan, F. (2018). User-Invariant Facial Animation with Convolutional Neural Network. In: Cheng, L., Leung, A., Ozawa, S. (eds) Neural Information Processing. ICONIP 2018. Lecture Notes in Computer Science(), vol 11301. Springer, Cham. https://doi.org/10.1007/978-3-030-04167-0_25

Download citation

DOI: https://doi.org/10.1007/978-3-030-04167-0_25
Published: 17 November 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-04166-3
Online ISBN: 978-3-030-04167-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics