The Research on Automatic Acquirement of the Domain Terms

  • Liangliang Liu
  • Haitao Wang
  • Jing Zhao
  • Fan Zhang
  • Chao Zhao
  • Gang Wu
  • Xinyu CaoEmail author
Conference paper
Part of the Advances in Intelligent Systems and Computing book series (AISC, volume 965)


There are different features in domain terms on different domain. In this paper, we took TCM clinical symptom terms as example to discuss the acquirement of domain terms due to the particularity and complexity in clinical symptom terms. We analyze the feature of TCM clinical symptom terms, and define the formal representation of the word-formation. Then we use the term in the TCM Clinical Terminology as seed terms, and generate word-formation rule base. We recognize the new TCM clinical symptom terms in the medical records based on the word-formation rule base. Then we verify the recognized terms with statistical method to implement the automatic recognition of TCM clinical symptom terms, as the basis of data analysis and data application in the further.


Automatic acquirement Knowledge ontology Domain terms TCM clinical symptom terms 



This paper is supported by grants from National Key R&D Program of China (2018YFF0213901) and China National Institute of Standardization(522016Y-4681).


  1. 1.
    Beatrice, D., Eric, G., Jean, M.L.: Towards automatic extraction of monolingual and bilingual terminology. In: Proceedings of the 15th conference on Computational Linguistics, Japan, pp. 515–521 (1994)Google Scholar
  2. 2.
    Church, K., Hanks, K.: Word Association Norms, Mutual Information and Lexicography. In: Proceedings of the 27th Annual Meeting on Association for Computational Linguistics, Vancouver, British Columbia, Canada, pp. 76–83 (1989)Google Scholar
  3. 3.
    Frantzi, K., Ananiadou, S.: The C-value/NC-value domain independent method for multi-word term extraction. J. Nat. Lang. Process. 6(3), 145–179 (1999)CrossRefGoogle Scholar
  4. 4.
    Hongbing, X.: Structural features and distribution of Chinese-English terms in the corpus from information field. Inf. Technol. Appl. 1, 22–25 (2000)Google Scholar
  5. 5.
    Du, B., Tian, H., Wang, L., et al.: Design of domain-specific term extractor based on multi-strategy. Comput. Eng. 31(14), 159–160 (2005)Google Scholar
  6. 6.
    Chen, W., Zhu, J.: Automatic learning field words by bootstrapping. In: The Proceedings of the Seventh National Joint Conference on Computational Linguistics, pp. 67–72. Tsinghua University Press, Beijing (2003)Google Scholar
  7. 7.
    Olsson, F., Eriksson, G., Franzen, K., et al.: Notions of correctness when evaluating protein name taggers. In: Proceedings of the 19th International Conference on Computational Linguistics, pp. 765–771 (2002)Google Scholar
  8. 8.
    Wang, S., Li, S., Chen, T.: Recognition of Chinese medicine named entity based on condition random field. J. Xiamen Univ. (Nat. Sci.) 48(3), 359–364 (2009)Google Scholar
  9. 9.
    Zhao, X.: On the Research of TCM Knowledge Discovery System Based on Web Ming. Beijing Jiaotong University, Beijing (2010)Google Scholar
  10. 10.
    Zhang, W., Bai, Y., Wang, P., et al.: An automatic domain terms extractor method on traditional Chinese medicine books. J. Shenyang Aerosp. Univ. 28(1), 72–75 (2011)Google Scholar
  11. 11.
    Hongyu, M.: Automatic identification of TCM terminology in Shanghan Lun based on condition random field. J. Beijing Univ. Tradit. Chin. Med. (2014)Google Scholar

Copyright information

© Springer Nature Switzerland AG 2020

Authors and Affiliations

  • Liangliang Liu
    • 1
  • Haitao Wang
    • 2
  • Jing Zhao
    • 2
  • Fan Zhang
    • 2
  • Chao Zhao
    • 2
  • Gang Wu
    • 2
  • Xinyu Cao
    • 2
    Email author
  1. 1.Shanghai School of Statistics and InformationShanghai University of International Business and EconomicsShanghaiChina
  2. 2.Beijing China National Institute of StandardizationBeijingChina

Personalised recommendations