Abstract
In this paper, a chunk-based multi-strategy machine translation method is proposed. Firstly, an English-Chinese bilingual tree-bank is constructed. Then, a translation strategy based on the chunk that combines statistics and rules is used in the translation stage. Through hierarchical sub-chunks, the input sentence is divided into a set of chunk sequence. Each chunk searches the corresponding instance in the corpus. Translation is completed by recursive refinement from chunks to words. Conditional Random Fields model is used to divide chunks. An experimental English-Chinese translation system is deployed, and experimental results show that the system performs better than the Systran system.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Babhulgaonkar, A.R., Bharad, S.V.: Statistical machine translation. In: 1st International Conference on Intelligent Systems and Information Management, pp. 62–67. Institute of Electrical and Electronics Engineers Inc. (2017)
Gong, H.: The role of speech recognition and machine translation in interpreting. Study Lang. Arts Sports 5, 383–385 (2018)
Semmar, N., Laib, M.: Building multiword expressions bilingual lexicons for domain adaptation of an example-based machine translation system. In: 11th International Conference on Recent Advances in Natural Language Processing, pp. 661–669. Association for Computational Linguistics (2017)
Chua, C.C., Lim, T.Y., Soon, L.: Meaning preservation in example-based machine translation with structural semantics. Expert Syst. Appl. 78, 242–258 (2017)
Mahata, S.K., Das, D., Bandyopadhyay, S.: MTIL2017: machine translation using recurrent neural network on statistical machine translation. J. Intell. Syst. (2018)
Wang, X., Lu, Z., Tu, Z., et al.: Neural machine translation advised by statistical machine translation. In: 31st AAAI Conference on Artificial Intelligence, pp. 3330–3336. AAAI press (2017)
Sun, L., Jin, Y., Du, L., Sun, Y.: Automatic extraction of bilingual term lexicon from parallel corpora. J. Chin. Inform. Process. 14(6), 33–39 (2000)
Branco, A., Carvalheiro, C., Costa, F., et al.: DeepBankPT and companion Portuguese treebanks in a multilingual collection of treebanks aligned with the penn Treebank. In: 11th International Conference on Computational Processing of Portuguese, pp. 207–213. Springer (2014)
Badmaeva, E., Tyers, F.M.: A dependency treebank for Buryat. In: 17th International Conference on Intelligent Text Processing and Computational Linguistics, pp. 397–408. Springer (2018)
Bielinskiene, A., Boizou, L., Kovalevskaite, J., Rimkute, E.: Lithuanian dependency treebank ALKSNIS. In: 7th International Conference on Human Language Technologies - The Baltic Perspective, pp. 107–114. IOS Press (2016)
Song, D., Liu, W., Zhou, T., et al.: Efficient robust conditional random fields. IEEE Trans. Image Process. 24(10), 3124–3136 (2015)
BLEU-WIKIPEDIA. https://en.wikipedia.org/wiki/BLEU. Accessed 12 Feb 2018
Acknowledgment
The authors are very grateful to Special Projects for Reform and Development of Beijing Institute of Science and Technology Information (2018) (Information rapid processing capacity building with applied artificial intelligence and big data technology) for the supports and assistance.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Wang, Y., Zhang, F. (2019). A Chunk-Based Multi-strategy Machine Translation Method. In: Krömer, P., Zhang, H., Liang, Y., Pan, JS. (eds) Proceedings of the Fifth Euro-China Conference on Intelligent Data Analysis and Applications. ECC 2018. Advances in Intelligent Systems and Computing, vol 891. Springer, Cham. https://doi.org/10.1007/978-3-030-03766-6_38
Download citation
DOI: https://doi.org/10.1007/978-3-030-03766-6_38
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-03765-9
Online ISBN: 978-3-030-03766-6
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)