DeepSIC: Deep Semantic Image Compression

  • Sihui Luo
  • Yezhou Yang
  • Yanling Yin
  • Chengchao Shen
  • Ya Zhao
  • Mingli SongEmail author
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 11301)


Incorporating semantic analysis into image compression can significantly reduce the repetitive computation of fundamental semantic analysis in client-side applications such as semantic image retrieval. The same practice also enables the compressed code to carry semantic information of the image during its storage and transmission. In this paper, we propose a Deep Semantic Image Compression (DeepSIC) model to achieve this goal and put forward two novel architectures that aim to reconstruct the compressed image and generate corresponding semantic representations at the same time by a single end-to-end optimized network. The first architecture performs semantic analysis in the encoding process by reserving a portion of the bits from the compressed code to store the semantic representations. The second performs semantic analysis in the decoding step with the feature maps that are embedded in the compressed code. In both architectures, the feature maps are shared by the compression and the semantic analytics modules. Experiments over benchmarking datasets show promising performance of the proposed compression model.


Deep image compression Semantic image compression End-to-end optimization 



This work is supported by National Natural Science Foundation of China (61572428, U1509206), Fundamental Research Funds for the Central Universities (2017FZA5014), National Key Research and Development Program (2016YFB1200203) and Key Research and Development Program of Zhejiang Province (2018C01004).


  1. 1.
    Ballé, J., Laparra, V., Simoncelli, E.P.: End-to-end optimized image compression. arXiv preprint arXiv:1611.01704 (2016)
  2. 2.
    Franzen, R.: Kodak lossless true color image suite (1999).
  3. 3.
    Gregor, K., Besse, F., Rezende, D.J., Danihelka, I., Wierstra, D.: Towards conceptual compression. In: Advances in Neural Information Processing Systems, pp. 3549–3557 (2016)Google Scholar
  4. 4.
    He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778 (2016)Google Scholar
  5. 5.
    Huang, G., Liu, Z., Van Der Maaten, L., Weinberger, K.Q.: Densely connected convolutional networks. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 2261–2269 (2017)Google Scholar
  6. 6.
    Kingma, D., Ba, J.: Adam: a method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014)
  7. 7.
    Long, J., Shelhamer, E., Darrell, T.: Fully convolutional networks for semantic segmentation. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 3431–3440 (2015)Google Scholar
  8. 8.
    Marpe, D., Schwarz, H., Wiegand, T.: Context-based adaptive binary arithmetic coding in the H. 264/AVC video compression standard. IEEE Trans. Circ. Syst. Video Technol. 13(7), 620–636 (2003)CrossRefGoogle Scholar
  9. 9.
    Rabbani, M., Joshi, R.: An overview of the JPEG 2000 still image compression standard. Sig. Process. Image Commun. 17(1), 3–48 (2002)CrossRefGoogle Scholar
  10. 10.
    Rippel, O., Bourdev, L.: Real-time adaptive image compression. In: Proceedings of the 34th International Conference on Machine Learning, vol. 70, pp. 2922–2930. PMLR (2017)Google Scholar
  11. 11.
    Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 (2014)
  12. 12.
    Szegedy, C., et al.: Going deeper with convolutions. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 1–9 (2015)Google Scholar
  13. 13.
    Theis, L., Shi, W., Cunningham, A., Huszár, F.: Lossy image compression with compressive autoencoders. arXiv preprint arXiv:1703.00395 (2017)
  14. 14.
    Toderici, G., et al.: Variable rate image compression with recurrent neural networks. arXiv preprint arXiv:1511.06085 (2015)
  15. 15.
    Toderici, G., et al.: Full resolution image compression with recurrent neural networks. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 5435–5443 (2017)Google Scholar
  16. 16.
    Wallace, G.K.: The JPEG still picture compression standard. IEEE Trans. Consum. Electron. 38(1), xviii–xxxiv (1992)CrossRefGoogle Scholar
  17. 17.
    Wang, Z., Simoncelli, E.P., Bovik, A.C.: Multiscale structural similarity for image quality assessment. In: Thrity-Seventh Asilomar Conference on Signals, Systems Computers, vol. 2, pp. 1398–1402 (2003)Google Scholar

Copyright information

© Springer Nature Switzerland AG 2018

Authors and Affiliations

  • Sihui Luo
    • 1
  • Yezhou Yang
    • 2
  • Yanling Yin
    • 1
  • Chengchao Shen
    • 1
  • Ya Zhao
    • 1
  • Mingli Song
    • 1
    Email author
  1. 1.Zhejiang UniversityHangzhouChina
  2. 2.Arizona State UniversityTempeUSA

Personalised recommendations