The Research on Automatic Acquirement of the Domain Terms
There are different features in domain terms on different domain. In this paper, we took TCM clinical symptom terms as example to discuss the acquirement of domain terms due to the particularity and complexity in clinical symptom terms. We analyze the feature of TCM clinical symptom terms, and define the formal representation of the word-formation. Then we use the term in the TCM Clinical Terminology as seed terms, and generate word-formation rule base. We recognize the new TCM clinical symptom terms in the medical records based on the word-formation rule base. Then we verify the recognized terms with statistical method to implement the automatic recognition of TCM clinical symptom terms, as the basis of data analysis and data application in the further.
KeywordsAutomatic acquirement Knowledge ontology Domain terms TCM clinical symptom terms
This paper is supported by grants from National Key R&D Program of China (2018YFF0213901) and China National Institute of Standardization(522016Y-4681).
- 1.Beatrice, D., Eric, G., Jean, M.L.: Towards automatic extraction of monolingual and bilingual terminology. In: Proceedings of the 15th conference on Computational Linguistics, Japan, pp. 515–521 (1994)Google Scholar
- 2.Church, K., Hanks, K.: Word Association Norms, Mutual Information and Lexicography. In: Proceedings of the 27th Annual Meeting on Association for Computational Linguistics, Vancouver, British Columbia, Canada, pp. 76–83 (1989)Google Scholar
- 4.Hongbing, X.: Structural features and distribution of Chinese-English terms in the corpus from information field. Inf. Technol. Appl. 1, 22–25 (2000)Google Scholar
- 5.Du, B., Tian, H., Wang, L., et al.: Design of domain-specific term extractor based on multi-strategy. Comput. Eng. 31(14), 159–160 (2005)Google Scholar
- 6.Chen, W., Zhu, J.: Automatic learning field words by bootstrapping. In: The Proceedings of the Seventh National Joint Conference on Computational Linguistics, pp. 67–72. Tsinghua University Press, Beijing (2003)Google Scholar
- 7.Olsson, F., Eriksson, G., Franzen, K., et al.: Notions of correctness when evaluating protein name taggers. In: Proceedings of the 19th International Conference on Computational Linguistics, pp. 765–771 (2002)Google Scholar
- 8.Wang, S., Li, S., Chen, T.: Recognition of Chinese medicine named entity based on condition random field. J. Xiamen Univ. (Nat. Sci.) 48(3), 359–364 (2009)Google Scholar
- 9.Zhao, X.: On the Research of TCM Knowledge Discovery System Based on Web Ming. Beijing Jiaotong University, Beijing (2010)Google Scholar
- 10.Zhang, W., Bai, Y., Wang, P., et al.: An automatic domain terms extractor method on traditional Chinese medicine books. J. Shenyang Aerosp. Univ. 28(1), 72–75 (2011)Google Scholar
- 11.Hongyu, M.: Automatic identification of TCM terminology in Shanghan Lun based on condition random field. J. Beijing Univ. Tradit. Chin. Med. (2014)Google Scholar