A Proposed Language Model Based on LSTM

Zhang, Yumeng; Lu, Xuanmin; Quan, Bei; Wei, Yuanyuan

doi:10.1007/978-3-030-14657-3_35

Yumeng Zhang¹⁹,
Xuanmin Lu¹⁹,
Bei Quan¹⁹ &
…
Yuanyuan Wei¹⁹

Part of the book series: Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering ((LNICST,volume 271))

Included in the following conference series:

International Conference on Internet of Things as a Service

Abstract

In view of the shortcomings of language model N-gram, this paper presents a Long Short-Term Memory (LSTM)-based language model based on the advantage that LSTM can theoretically utilize any long sequence of information. It’s an improved RNN model. Experimental results show that the perplexity of the LSTM language model in the PBT corpus is only one-half that of the N-gram language model.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Lin, C.Y., Hovy, E.: Automatic evaluation of summaries using N-gram co-occurrence statistics. In: Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology. Association for Computational Linguistics, pp. 71–78 (2003)
Google Scholar
Xiong, W., Droppo, J., Huang, X., et al.: Achieving human parity in conversational speech recognition. IEEE/ACM Trans. Audio Speech Lang. Process. PP(99) (2016)
Google Scholar
Li, J., Zhang, H., Cai, X.Y., et al.: Towards end-to-end speech recognition for Chinese Mandarin using long short-term memory recurrent neural networks (2015)
Google Scholar
Tai, K.S., Socher, R., Manning, C.D.: Improved semantic representations from tree-structured long short-term memory networks. Comput. Sci. 5(1), 36 (2015)
Google Scholar
Mikolov, T.A.: Statistical language models based on neural networks (2012)
Google Scholar

Download references

Author information

Authors and Affiliations

Northwestern Polytechnical University, Xi’an, China
Yumeng Zhang, Xuanmin Lu, Bei Quan & Yuanyuan Wei

Authors

Yumeng Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Xuanmin Lu
View author publications
You can also search for this author in PubMed Google Scholar
Bei Quan
View author publications
You can also search for this author in PubMed Google Scholar
Yuanyuan Wei
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yumeng Zhang .

Editor information

Editors and Affiliations

Northwestern Polytechnical University, Xi′an, China
Bo Li
Northwestern Polytechnical University, Xi'an, China
Mao Yang
Shandong University, Jinan, Qinghai, China
Hui Yuan
Northwestern Polytechnical University, Xi'an, Shaanxi, China
Zhongjiang Yan

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zhang, Y., Lu, X., Quan, B., Wei, Y. (2019). A Proposed Language Model Based on LSTM. In: Li, B., Yang, M., Yuan, H., Yan, Z. (eds) IoT as a Service. IoTaaS 2018. Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, vol 271. Springer, Cham. https://doi.org/10.1007/978-3-030-14657-3_35

Download citation

DOI: https://doi.org/10.1007/978-3-030-14657-3_35
Published: 07 March 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-14656-6
Online ISBN: 978-3-030-14657-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics