Medical Knowledge Attention Enhanced Neural Model for Named Entity Recognition in Chinese EMR
Named entity recognition (NER) in Chinese electronic medical records (EMRs) has become an important task of clinical natural language processing (NLP). However, limited studies have been performed on the clinical NER study in Chinese EMRs. Furthermore, when end-to-end neural network models have improved clinical NER performance, medical knowledge dictionaries such as various disease association dictionaries, which provide rich information of medical entities and relations among them, are rarely utilized in NER model. In this study, we investigate the problem of NER in Chinese EMRs and propose a clinical neural network NER model enhanced with medical knowledge attention by combining the entity mention information contained in external medical knowledge bases with EMR context together. Experimental results on the manually labeled dataset demonstrated that the proposed method can achieve better performance than the previous methods in most cases.
KeywordsChinese electronic medical record Named entity recognition Deep learning Knowledge attention
We would like to thank the anonymous reviewers for their valuable comments. The research work is supported by the National Natural Science Foundation of China (No. 61762081, No. 61662067) and the Key Research and Development Project of Gansu Province (No. 17YF1GA016).
- Carlson, A., Betteridge, J., Wang, R.C., et al.: Coupled semi-supervised learning for information extraction. DBLP, pp. 101–110 (2010)Google Scholar
- Chabchoub, M., Gagnon, M., Zouaq, A.: Collective disambiguation and semantic annotation for entity linking and typing. In: Sack, H., Dietze, S., Tordai, A., Lange, C. (eds.) SemWebEval 2016. CCIS, vol. 641, pp. 33–47. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46565-4_3CrossRefGoogle Scholar
- Chang, F.-X., Guo, J., Xu, W.-R., Chung, S.-R.: Application of word embeddings in biomedical named entity recognition tasks. J. Digit. Inf. Manag. 13(5), 321–327 (2015)Google Scholar
- Dong, X., Chowdhury, S., Qian, L., et al.: Transfer bi-directional LSTM RNN for named entity recognition in Chinese electronic medical records. In: The Proceedings of International Conference on E-Health Networking, Applications and Services, pp. 1–4. IEEE (2017)Google Scholar
- Le, H.-Q., Nguyen, T., Vu, S., Dang, T.-H.: D3NER: biomedical named entity recognition using CRF-biLSTM improved with fine-tuned embeddings of various linguistic information. Bioinformatics (2018). https://doi.org/10.1093/bioinformatics/bty356
- Li, L., Jin, L., Jiang, Y., Huang, D.: Recognizing biomedical named entities based on the sentence vector/twin word embeddings conditioned bidirectional LSTM. In: Sun, M., Huang, X., Lin, H., Liu, Z., Liu, Y. (eds.) CCL/NLP-NABD-2016. LNCS (LNAI), vol. 10035, pp. 165–176. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-47674-2_15CrossRefGoogle Scholar
- Liu, Y., Liu, K., Xu, L.-H. Zhao, J.: Exploring fine-grained entity type constraints for distantly supervised relation extraction. In: Proceedings of COLING 2014, Dublin, Ireland, 23–29 August (2014)Google Scholar
- Ma, X., Hovy, E.: End-to-end sequence labeling via bi-directional LSTM-CNNs-CRF (2016). https://arxiv.org/pdf/1603.01354
- Wang, S., Li, S., Chen, T.: Recognition of Chinese medicine named entity based on condition random field. J Xiamen Univ. (Nat. Sci.) 48, 349–364 (2009)Google Scholar
- Wang, Y., Liu, Y., Yu, Z., et al.: A preliminary work on symptom name recognition from free-text clinical records of traditional Chinese medicine using conditional random fields and reasonable features. In: Proceedings of the 2012 Workshop on Biomedical Natural Language Processing, Stroudsburg, PA, USA, pp. 223–30 (2012)Google Scholar
- Ye, F., Chen, Y.Y., Zhou, G.G., et al.: Intelligent recognition of named entity in electronic medical records. Chin. J. Biomed. Eng. 30(2), 256–262 (2011)Google Scholar