Abstract
A large number of medical research papers and clinical notes on disease diagnostic, treatment and prevention are increasing every day. This biomedical text provides a rich source of knowledge for biomedical research. However, this medical information is scattered in vast medical informatics literature in unstructured form. It is requisite to retrieve imperative information from these publications and discover new knowledge. A lot of research is done in biomedical text mining using different methods and techniques. Centre of i2b2 organized different challenges on natural language processing for medical text. In i2b2 2010, challenge tasks were focused on concept extraction, assertion classification and relation extraction, and in 2012, the task was temporal information extraction. In previous work, various machine learning techniques are found to be one of the effective techniques to extract clinical information from different types of medical data like discharge summary, physical notes. This paper presents the review of earlier work on different machine learning techniques and methods for medical research. The effectiveness of these techniques has been measured by precision, recall and F-score. This review will be useful for biomedical researchers to identify best techniques for the further research in clinical information extraction.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Minard, A.-L., Ligozat, A.-L., Ben Abacha, A., Bernhard, D., Cartoni, B., Deléger, L., Grau, B., Rosset, S., Zweigenbaum, P., Grouin, C.: Hybrid methods for improving information access in clinical documents: concept, assertion, and relation identification. J. Am. Med. Inform. Assoc. 18, 588 (2011)
Patrick, J.D., Nguyen, D.H.M., Wang, Y., Li, M.: A knowledge discovery and reuse pipeline for information extraction in clinical notes. J. Am. Med. Inform. Assoc. 18, 574–579 (2011)
Zhang, S., Elhadad, N.: Unsupervised biomedical named entity recognition: experiments with clinical and biological texts. J. Biomed. Inform. 46, 1088–1098 (2013)
Jiang, M., Chen, Y., Liu, M., Rosenbloom, S.T., Mani, S., Denny, J.C., Xu, H.: A study of machine-learning-based approaches to extract clinical entities and their assertions from discharge summaries. J. Am. Med. Inform. Assoc. JAMIA 18, 601–606 (2011)
Torii, M., Wagholikar, K., Liu, H.: Using machine learning for concept extraction on clinical documents from multiple data sources. J. Am. Med. Inform. Assoc. JAMIA 18, 580–587 (2011)
Jonnalagadda, S., Cohen, T., Wu, S., Gonzalez, G.: Enhancing clinical concept extraction with distributional semantics. J. Biomed. Inform. 45, 129–140 (2012)
Tang, B., Cao, H., Wu, Y., Jiang, M., Xu, H.: Recognizing clinical entities in hospital discharge summaries using Structural Support Vector Machines with word representation features. BMC Med. Inform. Decis. Mak. 13, S1 (2013)
Tang, B., Cao, H., Wu, Y., Jiang, M., Xu, H.: Clinical entity recognition using structural support vector machines with rich features. In: Proceedings of the ACM Sixth International Workshop on Data and Text Mining In Biomedical Informatics, pp. 13–20. ACM, New York (2012)
Kim, Y., Riloff, E.: A stacked ensemble for medical concept extraction from clinical notes
Kang, N., Afzal, Z., Singh, B., van Mulligen, E.M., Kors, J.A.: Using an ensemble system to improve concept extraction from clinical records. J. Biomed. Inform. 45, 423–428 (2012)
Gobbel, G.T., Reeves, R., Jayaramaraja, S., Giuse, D., Speroff, T., Brown, S.H., Elkin, P.L., Matheny, M.E.: Development and evaluation of RapTAT: a machine learning system for concept mapping of phrases from medical narratives. J. Biomed. Inform. 48, 54–65 (2014)
de Bruijn, B., Cherry, C., Kiritchenko, S., Martin, J., Zhu, X.: Machine-learned solutions for three stages of clinical information extraction: the state of the art at i2b2 2010. J. Am. Med. Inform. Assoc. JAMIA 18, 557–562 (2011)
Roberts, K., Harabagiu, S.M.: A flexible framework for deriving assertions from electronic medical records. J. Am. Med. Inform. Assoc. JAMIA 18, 568–573 (2011)
Xu, Y., Hong, K., Tsujii, J., Chang, E.I.C.: Feature engineering combined with machine learning and rule-based methods for structured information extraction from narrative clinical discharge summaries. J. Am. Med. Inform. Assoc. JAMIA 19, 824–832 (2012)
Rink, B., Harabagiu, S., Roberts, K.: Automatic extraction of relations between medical concepts in clinical texts. J. Am. Med. Inform. Assoc. 18, 594–600 (2011)
Cherry, C., Zhu, X., Martin, J., de Bruijn, B.: A la Recherche du Temps Perdu: extracting temporal relations from medical text in the 2012 i2b2 NLP challenge. J. Am. Med. Inform. Assoc. 20, 843–848 (2013)
Tang, B., Wu, Y., Jiang, M., Chen, Y., Denny, J.C., Xu, H.: A hybrid system for temporal information extraction from clinical text. J. Am. Med. Inform. Assoc. 20, 828–835 (2013)
Cheng, Y., Anick, P., Hong, P., Xue, N.: Temporal relation discovery between events and temporal expressions identified in clinical narrative. J. Biomed. Inform. 46, S48–S53 (2013)
Sun, W., Rumshisky, A., Uzuner, O.: Temporal reasoning over clinical text: the state of the art. J. Am. Med. Inform. Assoc. 20, 814–819 (2013)
Chang, Y.-C., Dai, H.-J., Wu, J.C.-Y., Chen, J.-M., Tsai, R.T.-H., Hsu, W.-L.: TEMPTING system: a hybrid method of rule and machine learning for temporal relation extraction in patient discharge summaries. J. Biomed. Inform. 46, S54–S62 (2013)
Lin, Y.-K., Chen, H., Brown, R.A.: MedTime: a temporal information extraction system for clinical narratives. J. Biomed. Inform. 46, S20–S28 (2013)
Xu, Y., Wang, Y., Liu, T., Tsujii, J., Chang, E.I.-C.: An end-to-end system to identify temporal relation in discharge summaries: 2012 i2b2 challenge. J. Am. Med. Inform. Assoc. 20, 849–858 (2013)
Roberts, K., Rink, B., Harabagiu, S.M.: A flexible framework for recognizing events, temporal expressions, and temporal relations in clinical text. J. Am. Med. Inform. Assoc. 20, 867–875 (2013)
Grouin, C., Grabar, N., Hamon, T., Rosset, S., Tannier, X., Zweigenbaum, P.: Eventual situations for timeline extraction from clinical reports. J. Am. Med. Inform. Assoc. 20, 820–827 (2013)
Nikfarjam, A., Emadzadeh, E., Gonzalez, G.: Towards generating a patients timeline: extracting temporal relationships from clinical notes. J. Biomed. Inform. 46, S40–S47 (2013)
Xu, H., AbdelRahman, S., Jiang, M., Fan, J.W., Huang, Y.: An initial study of full parsing of clinical text using the Stanford Parser. In: 2011 IEEE International Conference on Bioinformatics and Biomedicine Workshops (BIBMW), pp. 607–614 (2011)
Griffis, D., Shivade, C., Fosler-Lussier, E., Lai, A.M.: A quantitative and qualitative evaluation of sentence boundary detection for the clinical domain. AMIA Summits Transl. Sci. Proc. 2016, 88–97 (2016)
Bodenreider, O.: The Unified Medical Language System (UMLS): integrating biomedical terminology. Nucl. Acids Res. 32, D267–D270 (2004)
Dehghan, A.: Boundary identification of events in clinical named entity recognition. arXiv:1308.1004 (2013)
Dehghan, A., Keane, J.A., Nenadic, G.: Challenges in clinical named entity recognition for decision support. In: 2013 IEEE International Conference on Systems, Man, and Cybernetics, pp. 947–951 (2013)
Kang, N., Barendse, R.J., Afzal, Z., Singh, B., Schuemie, M.J., van Mulligen, E.M., Kors, J.A.: Erasmus MC approaches to the i2b2 Challenge. In: Proceedings of the 2010 i2b2/VA Workshop on Challenges in Natural Language Processing for Clinical Data, Boston, MA, USA: i2b2 (2010)
Gurulingappa, H., Hofmann-Apitius, M., Fluck, J.: Concept identification and assertion classification in patient health records. In: Proceedings of the 2010 i2b2/VA Workshop on Challenges in Natural Language Processing for Clinical Data. i2b2, Boston, MA, USA (2010)
deBruijn, B., Cherry, C., Kiritchenko, S., Martin, J., Zhu, X.: NRC at i2b2: one challenge, three practical tasks, nine statistical systems, hundreds of clinical records, millions of useful features. In: Proceedings of the 2010 i2b2/VA Workshop on Challenges in Natural Language Processing for Clinical Data. Boston, MA, USA: i2b2 (2010)
Wang, Y., Patrick, J.: Cascading classifiers for named entity recognition in clinical notes. In: Proceedings of the Workshop on Biomedical Information Extraction, pp. 42–49. Association for Computational Linguistics (2009)
Kim, Y., Riloff, E.: A stacked ensemble for medical concept extraction from clinical notes. AMIA Jt. Summits Transl. Sci. Proc. 2015: 737–746 (2015)
Uzuner, Ö., South, B.R., Shen, S., DuVall, S.L.: 2010 i2b2/VA challenge on concepts, assertions, and relations in clinical text. J. Am. Med. Inform. Assoc. JAMIA 18, 552–556 (2011)
Clark, C., Aberdeen, J., Coarr, M., Tresner-Kirsch, D., Wellner, B., Yeh, A., Hirschman, L.: MITRE system for clinical assertion status classification. J. Am. Med. Inform. Assoc. JAMIA 18, 563–567 (2011)
Grouin, C., Abacha, A.B., Bernhard, D., Cartoni, B., Deleger, L., Grau, B., Ligozat, A.-L., Minard, A.-L., Rosset, S., Zweigenbaum, P.: CARAMBA: concept, assertion, and relation annotation using machine-learning based approaches. In: i2b2 Medication Extraction Challenge Workshop (2010)
Clark, C., Aberdeen, J., Coarr, M., Tresner-Kirsch, D., Wellner, B., Yeh, A., Hirschman, L.: Determining assertion status for medical problems in clinical records (2011)
Solt, I., Szidarovszky, F.P., Tikk, D.: Concept, assertion and relation extraction at the 2010 i2b2 relation extraction challenge using parsing information and dictionaries. In: Proceedings of i2b2/VA Shared-Task, Washington, DC (2010)
Reeves, R.M., Ong, F.R., Matheny, M.E., Denny, J.C., Aronsky, D., Gobbel, G.T., Montella, D., Speroff, T., Brown, S.H.: Detecting temporal expressions in medical narratives. Int. J. Med. Inform. 82, 118–127 (2013)
Sun, W., Rumshisky, A., Uzuner, O.: Evaluating temporal relations in clinical text: 2012 i2b2 Challenge. J. Am. Med. Inform. Assoc. 20, 806–813 (2013)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this chapter
Cite this chapter
Patel, R., Tanwani, S. (2019). Application of Machine Learning Techniques in Clinical Information Extraction. In: Mishra, M., Mishra, B., Patel, Y., Misra, R. (eds) Smart Techniques for a Smarter Planet. Studies in Fuzziness and Soft Computing, vol 374. Springer, Cham. https://doi.org/10.1007/978-3-030-03131-2_8
Download citation
DOI: https://doi.org/10.1007/978-3-030-03131-2_8
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-03130-5
Online ISBN: 978-3-030-03131-2
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)