Abstract
HowNet is a Chinese-English Bilingual common-sense knowledge base, playing an important role in machine translation tasks. However, when facing domain-specific machine translation tasks, HowNet must be supplemented with domain-specific terminologies. In other words, we need to construct domain terminology semantic knowledge base. In this paper, we propose a method to automatically construct domain terminology knowledge base, based on the headword of a terminology. Specifically, the semantic meaning (HowNet DEF) of an unseen terminology is defined as one of the semantic meanings of the headword of the terminology. Headword disambiguation is done by considering the context of headwords and adding domain-specific disambiguation rules to the general disambiguation rules. Experiments on aviation domain show that our proposed method on headword disambiguation achieves 9.4% improvement based on the default disambiguation tools in HowNet. We also find that with our automatically constructed domain terminology knowledge base, HowNet machine translation system achieves better translation quality.
This work is supported by the Youth Growth Foundation of School(№-20141502/215108) and National Natural Science Foundation of China (№-61402299).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Dong, Z., Dong, Q.: HowNet. http://www.keenage.com
Wang, Y., Bai, Yu., Ding, C.: Construction of TCM theoretical knowledge base for semantic retrieval. J. Chin. Inf. Process. 26(5), 72–78 (2012)
Zhang, G., Diao, L., Diao, P.: Construction of aviation termiology semantic knowledge base based on HowNet. J. Chin. Inf. Process. 28(5), 92–101 (2014)
Liu, J., Tang, H., Tang, H.: Semantic knowledge base constructed from chinese online encyclopedia. J. Syst. Simul. 28(3), 542–548 (2016)
Cui, L., Chen, Q.C., Guo, H.Z., Wang, X.L.: Auto-extraction approach of sematic class attributes in the fusion of HowNet and Wikipedia. In: Advances of Computational Linguistics in China (2009)
Dong, Z., Dong, Q., Hao, C.: Semantic computing in HowNet MT system. In: CWMT 2014, pp. 45–54 (2014)
Dong, Z., Dong, Q.: HowNet and the Computation of Meaning. World Scientific, Singapore (2006)
Dong, Z., Dong, Q.: HowNet and its computation of meaning. In: International Conference on Computational Linguistics: Demonstrations, vol. 88(8), pp. 301–306 (2010)
Dong, Z., Dong, Q., Hao, C.: Theoretical findings of HowNet. J. Chin. Inf. Process. 21(4), 3–9 (2007)
Dong, Z., Dong, Q., Hao, C.: Sense colony testing in HowNet MT system. In: CWMT 2014, pp. 55–63 (2014)
Tang, G., Yu, D., Xun, E.: An unsupervised word sense disambiguation method based on the representation of sememe in HowNet. J. Chin. Inf. Process. 06 (2015)
Yang, Z.: Word sense disambiguation method based on knowledge context. J. Comput. Appl. 35(4), 1006–1008 (2015)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Wu, C., Wang, L., Ye, N., Zhang, G., Cai, D. (2016). Automatic Construction of Domain Terminology Knowledge Base for HowNet Based on the Headword. In: Yang, M., Liu, S. (eds) Machine Translation. CWMT 2016. Communications in Computer and Information Science, vol 668. Springer, Singapore. https://doi.org/10.1007/978-981-10-3635-4_6
Download citation
DOI: https://doi.org/10.1007/978-981-10-3635-4_6
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-3634-7
Online ISBN: 978-981-10-3635-4
eBook Packages: Computer ScienceComputer Science (R0)