Using Semantic Structure to Improve Chinese-English Term Translation

Zhang, Guiping; Liu, Ruiqian; Ye, Na; Huang, Haihong

doi:10.1007/978-3-319-12277-9_17

Guiping Zhang²¹,
Ruiqian Liu²¹,
Na Ye²¹ &
…
Haihong Huang²²

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 8801))

Included in the following conference series:

1575 Accesses

Abstract

This paper introduces a method which aims at translating Chinese terms into English. Our motivation is providing deep semantic-level information for term translation through analyzing the semantic structure of terms. Using the contextual information in the term and the first sememe of each word in HowNet as features, we trained a Support Vector Machine (SVM) model to identify the dependencies among words in a term. Then a Conditional Random Field (CRF) model is trained to mark semantic relations for term dependencies. During translation, the semantic relations within the Chinese terms are identified and three features based on semantic structure are integrated into the phrase-based statistical machine translation system. Experimental results show that the proposed method achieves 1.58 BLEU points improvement in comparison with the baseline system.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Cao, Y., Li, H.: Base noun phrase translation using web data and the EM algorithm. In: Proceedings of the 19th International Conference on Computational Linguistics, vol. 1, pp. 1–7. Association for Computational Linguistics (2002)
Google Scholar
Fang, G., Yu, H., Nishino, F.: Chinese-English term translation mining based on semantic prediction. In: Proceedings of the COLING/ACL on Main Conference Poster Sessions, pp. 199–206. Association for Computational Linguistics (2006)
Google Scholar
Wang, J., Zhang, G., Ye, N., Zhou, L.: Research on Japanese-Chinese Term Translation Technique Based on Multi-Features. In: Chinese Conference on Pattern Recognition, CCPR 2009, pp. 1–5. IEEE (2009)
Google Scholar
Kang, B.K., Chen, Y.R., Chang, B.B., Yu, S.W.: Translating multi word terms into Korean from Chinese documents. In: Proceedings of 2005 IEEE International Conference on Natural Language Processing and Knowledge Engineering, IEEE NLP-KE 2005, pp. 449–454. IEEE (2005)
Google Scholar
Wu, X., Okazaki, N., Tsunakawa, T., Tsujii, J.I.: Improving English-to-Chinese translation for technical terms using morphological information. In: AMTA 2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, pp. 202–211 (2008)
Google Scholar
Tsuji, K.: Automatic extraction of translational Japanese-KATAKANA and English word pairs from bilingual corpora. International Journal of Computer Processing of Oriental Languages 15(03), 261–279 (2002)
Article Google Scholar
Xiao, T., Zhu, J., Zhang, H., Li, Q.: NiuTrans: an open source toolkit for phrase-based and syntax-based machine translation. In: Proceedings of the ACL 2012 System Demonstrations, pp. 19–24. Association for Computational Linguistics (2012)
Google Scholar
Beale, S., Nirenburg, S., Mahesh, K.: Semantic analysis in the Mikrokosmos machine translation project. In: Proceedings of the 2nd Symposium on Natural Language Processing, pp. 297–307 (1995)
Google Scholar
Dong, Z., Dong, Q.: HowNet and the Computation of Meaning, pp. 1–316. World Scientific, Singapore (2006)
Book Google Scholar
Liu, Q., Li, S.: Word similarity computing based on How-net. Computational Linguistics and Chinese Language Processing 7(2), 59–76 (2002)
Google Scholar
Och, F.J., Ney, H.: Discriminative training and maximum entropy models for statistical machine translation. In: Proceedings of the 40th Annual Meeting on Association for Computational Linguistics, pp. 295–302. Association for Computational Linguistics (2002)
Google Scholar

Download references

Author information

Authors and Affiliations

Knowledge Engineering Research Center, Shenyang Aerospace University, Shenyang, China
Guiping Zhang, Ruiqian Liu & Na Ye
Chinese COMAC Shanghai Aircraft Design and Research Institute, Shanghai, China
Haihong Huang

Authors

Guiping Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Ruiqian Liu
View author publications
You can also search for this author in PubMed Google Scholar
Na Ye
View author publications
You can also search for this author in PubMed Google Scholar
Haihong Huang
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science and Technology, Tsinghua University, Haidian District, 100084, Beijing, China
Maosong Sun & Yang Liu &
Chinese Academy of Sciences, Institute of Automation, 100190, Beijing, China
Jun Zhao

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zhang, G., Liu, R., Ye, N., Huang, H. (2014). Using Semantic Structure to Improve Chinese-English Term Translation. In: Sun, M., Liu, Y., Zhao, J. (eds) Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data. NLP-NABD CCL 2014 2014. Lecture Notes in Computer Science(), vol 8801. Springer, Cham. https://doi.org/10.1007/978-3-319-12277-9_17

Download citation

DOI: https://doi.org/10.1007/978-3-319-12277-9_17
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-12276-2
Online ISBN: 978-3-319-12277-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics