Post-training Quantization of Deep Neural Network Weights

Khayrov, E. M.; Malsagov, M. Yu.; Karandashev, I. M.

doi:10.1007/978-3-030-30425-6_27

E. M. Khayrov⁶,
M. Yu. Malsagov⁶ &
I. M. Karandashev⁶

Part of the book series: Studies in Computational Intelligence ((SCI,volume 856))

Included in the following conference series:

International Conference on Neuroinformatics

976 Accesses
1 Citations

Abstract

The paper considers the quantization of weights as a tool for reducing the original size of an already trained neural net without having to perform the retraining. We have examined the methods based on uniform and exponential weight quantization and compared the results. Besides, we demonstrate the use of the quantization algorithm in three neural nets: VGG16, VGG19 and ResNet50.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition (2014). https://arxiv.org/abs/1409.1556
ImageNet – huge image dataset. http://www.image-net.org
Zhu, C., Han, S., Mao, H., Dally, W.J.: Trained ternary quantization. https://arxiv.org/abs/1612.01064
Zhou, S., Ni, Z., Zhou, X., Wen, H., Wu, Y., Zou, Y.: Dorefa-net: training low bitwidth convolutional neural networks with low bitwidth gradients. https://arxiv.org/pdf/1606.06160.pdf
Han, S., Mao, H., Dally, W.J.: Deep compression: compressing deep neural network with pruning, trained quantization and huffman coding. CoRR. https://arxiv.org/abs/1010.00149, February 2015
Cai, J., Takemoto, M., Nakajo, H.: A deep look into logarithmic quantization of model parameters in neural networks. In: The 10th International Conference on Advances in Information Technology (IAIT2018), Bangkok, Thailand, 10–13 December 2018, 8 pages. ACM, New York (2018). https://doi.org/10.1145/3291280.3291800

Download references

Author information

Authors and Affiliations

SRISA RAS, Moscow, Russia
E. M. Khayrov, M. Yu. Malsagov & I. M. Karandashev

Authors

E. M. Khayrov
View author publications
You can also search for this author in PubMed Google Scholar
M. Yu. Malsagov
View author publications
You can also search for this author in PubMed Google Scholar
I. M. Karandashev
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to M. Yu. Malsagov .

Editor information

Editors and Affiliations

Scientific Research Institute for System Analysis of Russian Academy of Sciences, Moscow, Russia
Boris Kryzhanovsky
Scientific Research Institute for System Analysis of Russian Academy of Sciences, Moscow, Russia
Witali Dunin-Barkowski
Scientific Research Institute for System Analysis of Russian Academy of Sciences, Moscow, Russia
Vladimir Redko
Moscow Aviation Institute (National Research University), Moscow, Russia
Yury Tiumentsev

Ethics declarations

The work financially supported by State Program of SRISA RAS No. 0065-2019-0003 (AAA-A19-119011590090-2).

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Khayrov, E.M., Malsagov, M.Y., Karandashev, I.M. (2020). Post-training Quantization of Deep Neural Network Weights. In: Kryzhanovsky, B., Dunin-Barkowski, W., Redko, V., Tiumentsev, Y. (eds) Advances in Neural Computation, Machine Learning, and Cognitive Research III. NEUROINFORMATICS 2019. Studies in Computational Intelligence, vol 856. Springer, Cham. https://doi.org/10.1007/978-3-030-30425-6_27

Download citation

DOI: https://doi.org/10.1007/978-3-030-30425-6_27
Published: 04 September 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-30424-9
Online ISBN: 978-3-030-30425-6
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics