Skip to main content

Automatic Construction of Domain Terminology Knowledge Base for HowNet Based on the Headword

  • Conference paper
  • First Online:
Machine Translation (CWMT 2016)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 668))

Included in the following conference series:

  • 571 Accesses

Abstract

HowNet is a Chinese-English Bilingual common-sense knowledge base, playing an important role in machine translation tasks. However, when facing domain-specific machine translation tasks, HowNet must be supplemented with domain-specific terminologies. In other words, we need to construct domain terminology semantic knowledge base. In this paper, we propose a method to automatically construct domain terminology knowledge base, based on the headword of a terminology. Specifically, the semantic meaning (HowNet DEF) of an unseen terminology is defined as one of the semantic meanings of the headword of the terminology. Headword disambiguation is done by considering the context of headwords and adding domain-specific disambiguation rules to the general disambiguation rules. Experiments on aviation domain show that our proposed method on headword disambiguation achieves 9.4% improvement based on the default disambiguation tools in HowNet. We also find that with our automatically constructed domain terminology knowledge base, HowNet machine translation system achieves better translation quality.

This work is supported by the Youth Growth Foundation of School(№-20141502/215108) and National Natural Science Foundation of China (№-61402299).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Dong, Z., Dong, Q.: HowNet. http://www.keenage.com

  2. Wang, Y., Bai, Yu., Ding, C.: Construction of TCM theoretical knowledge base for semantic retrieval. J. Chin. Inf. Process. 26(5), 72–78 (2012)

    Google Scholar 

  3. Zhang, G., Diao, L., Diao, P.: Construction of aviation termiology semantic knowledge base based on HowNet. J. Chin. Inf. Process. 28(5), 92–101 (2014)

    Google Scholar 

  4. Liu, J., Tang, H., Tang, H.: Semantic knowledge base constructed from chinese online encyclopedia. J. Syst. Simul. 28(3), 542–548 (2016)

    Google Scholar 

  5. Cui, L., Chen, Q.C., Guo, H.Z., Wang, X.L.: Auto-extraction approach of sematic class attributes in the fusion of HowNet and Wikipedia. In: Advances of Computational Linguistics in China (2009)

    Google Scholar 

  6. Dong, Z., Dong, Q., Hao, C.: Semantic computing in HowNet MT system. In: CWMT 2014, pp. 45–54 (2014)

    Google Scholar 

  7. Dong, Z., Dong, Q.: HowNet and the Computation of Meaning. World Scientific, Singapore (2006)

    Book  Google Scholar 

  8. Dong, Z., Dong, Q.: HowNet and its computation of meaning. In: International Conference on Computational Linguistics: Demonstrations, vol. 88(8), pp. 301–306 (2010)

    Google Scholar 

  9. Dong, Z., Dong, Q., Hao, C.: Theoretical findings of HowNet. J. Chin. Inf. Process. 21(4), 3–9 (2007)

    MathSciNet  Google Scholar 

  10. Dong, Z., Dong, Q., Hao, C.: Sense colony testing in HowNet MT system. In: CWMT 2014, pp. 55–63 (2014)

    Google Scholar 

  11. Tang, G., Yu, D., Xun, E.: An unsupervised word sense disambiguation method based on the representation of sememe in HowNet. J. Chin. Inf. Process. 06 (2015)

    Google Scholar 

  12. Yang, Z.: Word sense disambiguation method based on knowledge context. J. Comput. Appl. 35(4), 1006–1008 (2015)

    MathSciNet  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Lin Wang .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer Nature Singapore Pte Ltd.

About this paper

Cite this paper

Wu, C., Wang, L., Ye, N., Zhang, G., Cai, D. (2016). Automatic Construction of Domain Terminology Knowledge Base for HowNet Based on the Headword. In: Yang, M., Liu, S. (eds) Machine Translation. CWMT 2016. Communications in Computer and Information Science, vol 668. Springer, Singapore. https://doi.org/10.1007/978-981-10-3635-4_6

Download citation

  • DOI: https://doi.org/10.1007/978-981-10-3635-4_6

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-10-3634-7

  • Online ISBN: 978-981-10-3635-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics