Accuracy Evaluation of Long Short Term Memory Network Based Language Model with Fixed-Point Arithmetic

Jin, Ruochun; Jiang, Jingfei; Dou, Yong

doi:10.1007/978-3-319-56258-2_24

Ruochun Jin¹⁷,
Jingfei Jiang¹⁷ &
Yong Dou¹⁷

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 10216))

Included in the following conference series:

International Symposium on Applied Reconfigurable Computing

1859 Accesses
3 Citations

Abstract

Long Short Term Memory network based language models are state-of-art techniques in the field of natural language processing. Training LSTM networks is computationally intensive, which naturally results in investigating FPGA acceleration where fixed-point arithmetic is employed. However, previous studies have focused only on accelerators using some fixed bit-widths without thorough accuracy evaluation. The main contribution of this paper is to demonstrate the bit-width effect on the LSTM based language model and the tanh function approximation in a comprehensive way by experimental evaluation. Theoretically, the 12-bit number with 6-bit fractional part is the best choice balancing the accuracy and the storage saving. Gaining similar performance to the software implementation and fitting the bit-widths of FPGA primitives, we further propose a mixed bit-widths solution combing 8-bit numbers and 16-bit numbers. With clear trade-off in accuracy, our results provide a guide to inform the design choices on bit-widths when implementing LSTMs in FPGAs. Additionally, based on our experiments, it is amazing that the scale of the LSTM network is irrelevant to the optimum fixed-point configuration, which indicates that our results are applicable to larger models as well.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Abadi, M., Agarwal, A., Barham, P., Brevdo, E., Chen, Z., Citro, C., Corrado, G.S., Davis, A., Dean, J., Devin, M., et al.: Tensorflow: large-scale machine learning on heterogeneous distributed systems (2016). arXiv preprint arXiv:1603.04467
Hesham Amin, K., Curtis, M., Hayes-Gill, B.R.: Piecewise linear approximation applied to nonlinear function of a neural network. IEE Proc.-Circuits, Devices Syst. 144(6), 313–317 (1997)
Article Google Scholar
Bengio, Y., Ducharme, R., Vincent, P., Jauvin, C.: A neural probabilistic language model. J. Mach. Learn. Res. 3(Feb), 1137–1155 (2003)
MATH Google Scholar
Chang, A.X.M., Martini, B., Culurciello, E.: Recurrent neural networks hardware implementation on FPGA (2015). arXiv preprint arXiv:1511.05552
Hochreiter, S., Schmidhuber, J.: Long short-term memory. Neural Comput. 9(8), 1735–1780 (1997)
Article Google Scholar
Jiang, J., Rongdong, H., Mikel, L., Dou, Y.: Accuracy evaluation of deep belief networks with fixed-point arithmetic. Comput. Model. New Technol. 18(6), 7–14 (2014)
Google Scholar
Li, S., Chunpeng, W., Li, H., Boxun Li, Y., Wang, Q.Q.: FPGA acceleration of recurrent neural network based language model. In: 2015 IEEE 23rd Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM), pp. 111–118. IEEE (2015)
Google Scholar
Mikolov, T., Karafiát, M., Burget, L., Cernockỳ, J., Khudanpur, S.: Recurrent neural network based language model. In: Interspeech, vol. 2, p. 3 (2010)
Google Scholar
Nurvitadhi, E., Sim, J., Sheffield, D., Mishra, A., Krishnan, S., Marr, D.: Accelerating recurrent neural networks in analytics servers: comparison of FPGA, CPU, GPU, and ASIC. In: 2016 26th International Conference on Field Programmable Logic and Applications (FPL), pp. 1–4. EPFL (2016)
Google Scholar
Zaremba, W., Sutskever, I., Vinyals, O.: Recurrent neural network regularization (2014). arXiv preprint arXiv:1409.2329

Download references

Acknowledgement

This work was supported by the Natural Science Foundation of China under the grant No. 61303070.

Author information

Authors and Affiliations

National Laboratory for Parallel and Distributed Processing, National University of Defense Technology, Changsha, 410073, Hunan, China
Ruochun Jin, Jingfei Jiang & Yong Dou

Authors

Ruochun Jin
View author publications
You can also search for this author in PubMed Google Scholar
Jingfei Jiang
View author publications
You can also search for this author in PubMed Google Scholar
Yong Dou
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Ruochun Jin , Jingfei Jiang or Yong Dou .

Editor information

Editors and Affiliations

Delft University of Technology, Delft, The Netherlands
Stephan Wong
Federal University of Rio Grande do Sul, Porto Alegre, Brazil
Antonio Carlos Beck
Delft University of Technology, Delft, The Netherlands
Koen Bertels
Federal University of Rio Grande do Sul, Porto Alegre, Brazil
Luigi Carro

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Jin, R., Jiang, J., Dou, Y. (2017). Accuracy Evaluation of Long Short Term Memory Network Based Language Model with Fixed-Point Arithmetic. In: Wong, S., Beck, A., Bertels, K., Carro, L. (eds) Applied Reconfigurable Computing. ARC 2017. Lecture Notes in Computer Science(), vol 10216. Springer, Cham. https://doi.org/10.1007/978-3-319-56258-2_24

Download citation

DOI: https://doi.org/10.1007/978-3-319-56258-2_24
Published: 31 March 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-56257-5
Online ISBN: 978-3-319-56258-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics