Large margin deep embedding for aesthetic image classification

  • Guanjun Guo
  • Hanzi WangEmail author
  • Yan Yan
  • Liming Zhang
  • Bo Li


We present an LMDE method with a novel network structure and an effective joint loss function, which takes advantage of both the triplet loss function and the hinge loss function. The minimization of the joint loss function ensures that the intra-class variability of the features belonging to the same class is reduced and the inter-class separability of the features from different classes is increased. As shown in the experiments, the proposed LMDE method significantly outperforms several other state-of-the-art aesthetic classification methods in terms of classification accuracy.



This work was supported by National Natural Science Foundation of China (Grant Nos. U1605252, 61872307, 61472334, 61571379), National Key R&D Program of China (Grant No. 2017YFB1302400), and UM Multi-Year Research (Grant No. MYRG2017-00218-FST).

Supplementary material

11432_2018_9567_MOESM1_ESM.pdf (697 kb)
Large margin deep embedding for aesthetic image classification


  1. 1.
    Tang X O, Luo W, Wang X G. Content-based photo quality assessment. IEEE Trans Multimedia, 2013, 15: 1930–1943CrossRefGoogle Scholar
  2. 2.
    Datta R, Joshi D, Li J, et al. Studying aesthetics in photographic images using a computational approach. In: Proceedings of European Conference on Computer Vision, 2006. 288–301Google Scholar
  3. 3.
    Guo G J, Wang H Z, Shen C H, et al. Automatic image cropping for visual aesthetic enhancement using deep neural networks and cascaded regression. IEEE Trans Multimedia, 2018, 20: 2073–2085CrossRefGoogle Scholar
  4. 4.
    Pang Y W, Wang S, Yuan Y. Learning regularized LDA by clustering. IEEE Trans Neural Netw Learn Syst, 2014, 25: 2191–2201CrossRefGoogle Scholar
  5. 5.
    Krizhevsky A, Sutskever I, Hinton G H. ImageNet classification with deep convolutional neural networks. In: Proceedings of the 25th International Conference on Neural Information Processing Systems, 2012. 1097–1105Google Scholar
  6. 6.
    Jiang X H, Pang Y W, Sun M L, et al. Cascaded subpatch networks for effective CNNs. IEEE Trans Neural Netw Learn Syst, 2018, 29: 2684–2694MathSciNetGoogle Scholar
  7. 7.
    Schroff F, Kalenichenko D, Philbin J. Facenet: a unified embedding for face recognition and clustering. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2015. 815–823Google Scholar
  8. 8.
    Rojas R. Neural Networks: A Systematic Introduction. Berlin: Springer, 1996CrossRefzbMATHGoogle Scholar
  9. 9.
    Duchi J, Hazan E, Singer Y. Adaptive subgradient methods for online learning and stochastic optimization. J Mach Learn Res, 2011, 12: 2121–2159MathSciNetzbMATHGoogle Scholar

Copyright information

© Science China Press and Springer-Verlag GmbH Germany, part of Springer Nature 2019

Authors and Affiliations

  • Guanjun Guo
    • 1
  • Hanzi Wang
    • 1
    Email author
  • Yan Yan
    • 1
  • Liming Zhang
    • 2
  • Bo Li
    • 3
  1. 1.Fujian Key Laboratory of Sensing and Computing for Smart City, School of Information Science and EngineeringXiamen UniversityXiamenChina
  2. 2.Faculty of Science and TechnologyUniversity of MacauMacauChina
  3. 3.Beijing Key Laboratory of Digital Media, School of Computer Science and EngineeringBeihang UniversityBeijingChina

Personalised recommendations