The Usage of Grayscale or Color Images for Facial Expression Recognition with Deep Neural Networks

Yudin, Dmitry A.; Dolzhenko, Alexandr V.; Kapustina, Ekaterina O.

doi:10.1007/978-3-030-30425-6_32

Part of the book series: Studies in Computational Intelligence ((SCI,volume 856))

Included in the following conference series:

International Conference on Neuroinformatics

964 Accesses
1 Citations

Abstract

The paper describes usage of modern deep neural network architectures such as ResNet, DenseNet and Xception for the classification of facial expressions on color and grayscale images. Each image may contain one of eight facial expression categories: “Neutral”, “Happiness”, “Sadness”, “Surprise”, “Fear”, “Disgust”, “Anger”, “Contempt”. As the dataset was used AffectNet. The most accurate architecture is Xception. It gave classification accuracy on training sample 97.65%, on cleaned testing sample 57.48% and top-2 accuracy on cleaned testing sample 76.70%. The category “Contempt” is worst recognized by all the types of neural networks considered, which indicates its ambiguity and similarity with other types of facial expressions. Experimental results show that for the considered task it does not matter, the color or grayscale image is fed to the input of the algorithm. This fact can save a significant amount of memory when storing data sets and training neural networks. The computing experiments was performed using graphics processor using NVidia CUDA technology with Keras and Tensorflow deep learning frameworks. It showed that the average processing time of one image varies from 4 ms to 30 ms for different architectures. Obtained results can be used in software for neural network training for face recognition systems.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Zeng, N., Zhang, H., Song, B., Liu, W., Li, Y., Dobaie, A.M.: Facial expression recognition via learning deep sparse autoencoders. Neurocomputing 273, 643–649 (2018)
Article Google Scholar
Yudin, D., Knysh, A.: Vehicle recognition and its trajectory registration on the image sequence using deep convolutional neural network. In: The International Conference on Information and Digital Technologies, pp. 435–441 (2017)
Google Scholar
Yudin, D., Naumov, A., Dolzhenko, A., Patrakova, E.: Software for roof defects recognition on aerial photographs. J. Phys. Conf. Ser. 1015(3), 032152 (2018)
Article Google Scholar
Friesen, W., Ekman, P.: EMFACS-7: emotional facial action coding system. Unpublished Manuscript Univ. Calif. San Francisco 2(36), 1 (1983)
Google Scholar
Lyons, M.J., Akemastu, S., Kamachi, M., Gyoba, J.: Coding facial expressions with gabor wavelets. In: 3rd IEEE International Conference on Automatic Face and Gesture Recognition, pp. 200–205 (1998)
Google Scholar
Shan, C., Gong, S., McOwan, P.W.: Facial expression recognition based on local binary patterns: a comprehensive study. Image Vis. Comput. 27(6), 803–816 (2009)
Article Google Scholar
Kotsia, I., Pitas, I.: Facial expression recognition in image sequences using geometric deformation features and support vector machines. IEEE Trans. Image Process. 16(1), 172–187 (2006)
Article MathSciNet Google Scholar
Wang, J., Yin, L., Wei, X., Sun, Y.: 3D facial expression recognition based on primitive surface feature distribution. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2006) (2006)
Google Scholar
Lopes, A.T., de Aguiar, E., De Souza, A.F., Oliveira-Santos, T.: Facial expression recognition with convolutional neural networks: coping with few data and the training sample order. Pattern Recogn. 61, 610–628 (2017)
Article Google Scholar
Mollahosseini, A., Chan, D., Mahoor, M.H.: Going deeper in facial expression recognition using deep neural networks. In: IEEE Winter Conference on Applications of Computer Vision (WACV) (2016)
Google Scholar
Ding, H., Zhou, S.K., Chellappa, R.: FaceNet2ExpNet: regularizing a deep face recognition net for expression recognition. In: 12th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2017) (2017)
Google Scholar
Zhang, K., Huang, Y., Du, Y., Wang, L.: Facial expression recognition based on deep evolutional spatial-temporal networks. IEEE Trans. Image Process. 26(9), 4193–4203 (2017)
Article MathSciNet Google Scholar
Zhang, T., Zheng, W., Cui, Z., Zong, Y., Yan, J., Yan, K.: A deep neural network-driven feature learning method for multi-view facial expression recognition. IEEE Trans. Multimedia 18(12), 2528–2536 (2016)
Article Google Scholar
Face API oт Microsoft Azure. https://azure.microsoft.com/ru-ru/services/cognitive-services/face/#detection. Accessed 26 May 2019
Amazon Emotion API. https://docs.aws.amazon.com/rekognition/latest/dg/API_Emotion.html. Accessed 26 May 2019
Affectiva Emotion SDK. https://www.affectiva.com/product/emotion-sdk/. Accessed 26 May 2019
Lucey, P., Cohn, J.F., Kanade, T., Saragih, J., Ambadar, Z., Matthews, I.: The extended cohn-kanade dataset (CK+): a complete expression dataset for action unit and emotion-specified expression. In: Proceedings of the Third International Workshop on CVPR for Human Communicative Behavior Analysis (CVPR4HB 2010), pp. 94–101 (2010)
Google Scholar
Carrier, P.-L., Courville, A.: Challenges in representation learning: facial expression recognition challenge (2013). https://www.kaggle.com/c/challenges-in-representation-learning-facial-expression-recognition-challenge/data. Accessed 26 May 2019
Facial expressions. A set of images for classifying facial expressions. https://github.com/muxspace/facial_expressions. Accessed 26 May 2019
Afifi, M., Abdelhamed, A.: AFIF4: deep gender classification based on an AdaBoost-based fusion of isolated facial features and foggy faces. J. Vis. Commun. Image Represent. 62, 77–86 (2019)
Article Google Scholar
Mollahosseini, A., Hasani, B., Mahoor, M.H.: AffectNet: a database for facial expression, valence, and arousal computing in the wild. IEEE Trans. Affect. Comput. 10(1), 18–31 (2017)
Article Google Scholar
Olson, D.L., Delen, D.: Advanced Data Mining Techniques, 1st edn. Springer, Cham (2008)
MATH Google Scholar
Kaiming, H., Xiangyu, Z., Shaoqing, R., Jian, S.: Deep residual learning for image recognition. ECCV. arXiv:1512.03385 (2015)
Yudin, D., Kapustina, E.: Deep learning in vehicle pose recognition on two-dimensional images. Adv. Intell. Syst. Comput. 874, 434–443 (2019)
Google Scholar
Huang, G., Liu, Z., van der Maaten, L., Weinberger, K.Q.: Densely connected convolutional networks. CVPR 2017. arXiv:1608.06993 (2017)
Chollet, F.: Xception: deep learning with depthwise separable convolutions. CVPR 2017. arXiv:1610.02357 (2017)
Szegedy, C., Vanhoucke, V., Ioffe, S., Shlens, J., Wojna, Z.: Rethinking the inception architecture for computer vision. ECCV. arXiv:1512.00567 (2016)
Chollet, F.: Keras: deep learning library for theano and tensorflow. https://keras.io/. Accessed 26 May 2019

Download references

Acknowledgment

The research was made possible by Government of the Russian Federation (Agreement №. 075-02-2019-967).

Author information

Authors and Affiliations

Moscow Institute of Physics and Technology (National Research University), Institutsky Per. 9, Dolgoprudny, Moscow Region, 141700, Russia
Dmitry A. Yudin
Belgorod State Technological University named after V.G. Shukhov, Kostukova Str. 46, Belgorod, 308012, Russia
Alexandr V. Dolzhenko & Ekaterina O. Kapustina

Authors

Dmitry A. Yudin
View author publications
You can also search for this author in PubMed Google Scholar
Alexandr V. Dolzhenko
View author publications
You can also search for this author in PubMed Google Scholar
Ekaterina O. Kapustina
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Dmitry A. Yudin .

Editor information

Editors and Affiliations

Scientific Research Institute for System Analysis of Russian Academy of Sciences, Moscow, Russia
Boris Kryzhanovsky
Scientific Research Institute for System Analysis of Russian Academy of Sciences, Moscow, Russia
Witali Dunin-Barkowski
Scientific Research Institute for System Analysis of Russian Academy of Sciences, Moscow, Russia
Vladimir Redko
Moscow Aviation Institute (National Research University), Moscow, Russia
Yury Tiumentsev

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Yudin, D.A., Dolzhenko, A.V., Kapustina, E.O. (2020). The Usage of Grayscale or Color Images for Facial Expression Recognition with Deep Neural Networks. In: Kryzhanovsky, B., Dunin-Barkowski, W., Redko, V., Tiumentsev, Y. (eds) Advances in Neural Computation, Machine Learning, and Cognitive Research III. NEUROINFORMATICS 2019. Studies in Computational Intelligence, vol 856. Springer, Cham. https://doi.org/10.1007/978-3-030-30425-6_32

Download citation

DOI: https://doi.org/10.1007/978-3-030-30425-6_32
Published: 04 September 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-30424-9
Online ISBN: 978-3-030-30425-6
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics