Skip to main content

Abstract

This paper introduces a method which aims at translating Chinese terms into English. Our motivation is providing deep semantic-level information for term translation through analyzing the semantic structure of terms. Using the contextual information in the term and the first sememe of each word in HowNet as features, we trained a Support Vector Machine (SVM) model to identify the dependencies among words in a term. Then a Conditional Random Field (CRF) model is trained to mark semantic relations for term dependencies. During translation, the semantic relations within the Chinese terms are identified and three features based on semantic structure are integrated into the phrase-based statistical machine translation system. Experimental results show that the proposed method achieves 1.58 BLEU points improvement in comparison with the baseline system.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Cao, Y., Li, H.: Base noun phrase translation using web data and the EM algorithm. In: Proceedings of the 19th International Conference on Computational Linguistics, vol. 1, pp. 1–7. Association for Computational Linguistics (2002)

    Google Scholar 

  2. Fang, G., Yu, H., Nishino, F.: Chinese-English term translation mining based on semantic prediction. In: Proceedings of the COLING/ACL on Main Conference Poster Sessions, pp. 199–206. Association for Computational Linguistics (2006)

    Google Scholar 

  3. Wang, J., Zhang, G., Ye, N., Zhou, L.: Research on Japanese-Chinese Term Translation Technique Based on Multi-Features. In: Chinese Conference on Pattern Recognition, CCPR 2009, pp. 1–5. IEEE (2009)

    Google Scholar 

  4. Kang, B.K., Chen, Y.R., Chang, B.B., Yu, S.W.: Translating multi word terms into Korean from Chinese documents. In: Proceedings of 2005 IEEE International Conference on Natural Language Processing and Knowledge Engineering, IEEE NLP-KE 2005, pp. 449–454. IEEE (2005)

    Google Scholar 

  5. Wu, X., Okazaki, N., Tsunakawa, T., Tsujii, J.I.: Improving English-to-Chinese translation for technical terms using morphological information. In: AMTA 2008. MT at work: Proceedings of the Eighth Conference of the Association for Machine Translation in the Americas, pp. 202–211 (2008)

    Google Scholar 

  6. Tsuji, K.: Automatic extraction of translational Japanese-KATAKANA and English word pairs from bilingual corpora. International Journal of Computer Processing of Oriental Languages 15(03), 261–279 (2002)

    Article  Google Scholar 

  7. Xiao, T., Zhu, J., Zhang, H., Li, Q.: NiuTrans: an open source toolkit for phrase-based and syntax-based machine translation. In: Proceedings of the ACL 2012 System Demonstrations, pp. 19–24. Association for Computational Linguistics (2012)

    Google Scholar 

  8. Beale, S., Nirenburg, S., Mahesh, K.: Semantic analysis in the Mikrokosmos machine translation project. In: Proceedings of the 2nd Symposium on Natural Language Processing, pp. 297–307 (1995)

    Google Scholar 

  9. Dong, Z., Dong, Q.: HowNet and the Computation of Meaning, pp. 1–316. World Scientific, Singapore (2006)

    Book  Google Scholar 

  10. Liu, Q., Li, S.: Word similarity computing based on How-net. Computational Linguistics and Chinese Language Processing 7(2), 59–76 (2002)

    Google Scholar 

  11. Och, F.J., Ney, H.: Discriminative training and maximum entropy models for statistical machine translation. In: Proceedings of the 40th Annual Meeting on Association for Computational Linguistics, pp. 295–302. Association for Computational Linguistics (2002)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Zhang, G., Liu, R., Ye, N., Huang, H. (2014). Using Semantic Structure to Improve Chinese-English Term Translation. In: Sun, M., Liu, Y., Zhao, J. (eds) Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data. NLP-NABD CCL 2014 2014. Lecture Notes in Computer Science(), vol 8801. Springer, Cham. https://doi.org/10.1007/978-3-319-12277-9_17

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-12277-9_17

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-12276-2

  • Online ISBN: 978-3-319-12277-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics