Application of Machine Learning Techniques in Clinical Information Extraction

Patel, Ruchi; Tanwani, Sanjay

doi:10.1007/978-3-030-03131-2_8

Ruchi Patel⁶ &
Sanjay Tanwani⁶

Part of the book series: Studies in Fuzziness and Soft Computing ((STUDFUZZ,volume 374))

470 Accesses
4 Citations

Abstract

A large number of medical research papers and clinical notes on disease diagnostic, treatment and prevention are increasing every day. This biomedical text provides a rich source of knowledge for biomedical research. However, this medical information is scattered in vast medical informatics literature in unstructured form. It is requisite to retrieve imperative information from these publications and discover new knowledge. A lot of research is done in biomedical text mining using different methods and techniques. Centre of i2b2 organized different challenges on natural language processing for medical text. In i2b2 2010, challenge tasks were focused on concept extraction, assertion classification and relation extraction, and in 2012, the task was temporal information extraction. In previous work, various machine learning techniques are found to be one of the effective techniques to extract clinical information from different types of medical data like discharge summary, physical notes. This paper presents the review of earlier work on different machine learning techniques and methods for medical research. The effectiveness of these techniques has been measured by precision, recall and F-score. This review will be useful for biomedical researchers to identify best techniques for the further research in clinical information extraction.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Minard, A.-L., Ligozat, A.-L., Ben Abacha, A., Bernhard, D., Cartoni, B., Deléger, L., Grau, B., Rosset, S., Zweigenbaum, P., Grouin, C.: Hybrid methods for improving information access in clinical documents: concept, assertion, and relation identification. J. Am. Med. Inform. Assoc. 18, 588 (2011)
Article Google Scholar
Patrick, J.D., Nguyen, D.H.M., Wang, Y., Li, M.: A knowledge discovery and reuse pipeline for information extraction in clinical notes. J. Am. Med. Inform. Assoc. 18, 574–579 (2011)
Article Google Scholar
Zhang, S., Elhadad, N.: Unsupervised biomedical named entity recognition: experiments with clinical and biological texts. J. Biomed. Inform. 46, 1088–1098 (2013)
Article Google Scholar
Jiang, M., Chen, Y., Liu, M., Rosenbloom, S.T., Mani, S., Denny, J.C., Xu, H.: A study of machine-learning-based approaches to extract clinical entities and their assertions from discharge summaries. J. Am. Med. Inform. Assoc. JAMIA 18, 601–606 (2011)
Article Google Scholar
Torii, M., Wagholikar, K., Liu, H.: Using machine learning for concept extraction on clinical documents from multiple data sources. J. Am. Med. Inform. Assoc. JAMIA 18, 580–587 (2011)
Article Google Scholar
Jonnalagadda, S., Cohen, T., Wu, S., Gonzalez, G.: Enhancing clinical concept extraction with distributional semantics. J. Biomed. Inform. 45, 129–140 (2012)
Article Google Scholar
Tang, B., Cao, H., Wu, Y., Jiang, M., Xu, H.: Recognizing clinical entities in hospital discharge summaries using Structural Support Vector Machines with word representation features. BMC Med. Inform. Decis. Mak. 13, S1 (2013)
Article Google Scholar
Tang, B., Cao, H., Wu, Y., Jiang, M., Xu, H.: Clinical entity recognition using structural support vector machines with rich features. In: Proceedings of the ACM Sixth International Workshop on Data and Text Mining In Biomedical Informatics, pp. 13–20. ACM, New York (2012)
Google Scholar
Kim, Y., Riloff, E.: A stacked ensemble for medical concept extraction from clinical notes
Google Scholar
Kang, N., Afzal, Z., Singh, B., van Mulligen, E.M., Kors, J.A.: Using an ensemble system to improve concept extraction from clinical records. J. Biomed. Inform. 45, 423–428 (2012)
Article Google Scholar
Gobbel, G.T., Reeves, R., Jayaramaraja, S., Giuse, D., Speroff, T., Brown, S.H., Elkin, P.L., Matheny, M.E.: Development and evaluation of RapTAT: a machine learning system for concept mapping of phrases from medical narratives. J. Biomed. Inform. 48, 54–65 (2014)
Article Google Scholar
de Bruijn, B., Cherry, C., Kiritchenko, S., Martin, J., Zhu, X.: Machine-learned solutions for three stages of clinical information extraction: the state of the art at i2b2 2010. J. Am. Med. Inform. Assoc. JAMIA 18, 557–562 (2011)
Article Google Scholar
Roberts, K., Harabagiu, S.M.: A flexible framework for deriving assertions from electronic medical records. J. Am. Med. Inform. Assoc. JAMIA 18, 568–573 (2011)
Article Google Scholar
Xu, Y., Hong, K., Tsujii, J., Chang, E.I.C.: Feature engineering combined with machine learning and rule-based methods for structured information extraction from narrative clinical discharge summaries. J. Am. Med. Inform. Assoc. JAMIA 19, 824–832 (2012)
Article Google Scholar
Rink, B., Harabagiu, S., Roberts, K.: Automatic extraction of relations between medical concepts in clinical texts. J. Am. Med. Inform. Assoc. 18, 594–600 (2011)
Article Google Scholar
Cherry, C., Zhu, X., Martin, J., de Bruijn, B.: A la Recherche du Temps Perdu: extracting temporal relations from medical text in the 2012 i2b2 NLP challenge. J. Am. Med. Inform. Assoc. 20, 843–848 (2013)
Article Google Scholar
Tang, B., Wu, Y., Jiang, M., Chen, Y., Denny, J.C., Xu, H.: A hybrid system for temporal information extraction from clinical text. J. Am. Med. Inform. Assoc. 20, 828–835 (2013)
Article Google Scholar
Cheng, Y., Anick, P., Hong, P., Xue, N.: Temporal relation discovery between events and temporal expressions identified in clinical narrative. J. Biomed. Inform. 46, S48–S53 (2013)
Article Google Scholar
Sun, W., Rumshisky, A., Uzuner, O.: Temporal reasoning over clinical text: the state of the art. J. Am. Med. Inform. Assoc. 20, 814–819 (2013)
Article Google Scholar
Chang, Y.-C., Dai, H.-J., Wu, J.C.-Y., Chen, J.-M., Tsai, R.T.-H., Hsu, W.-L.: TEMPTING system: a hybrid method of rule and machine learning for temporal relation extraction in patient discharge summaries. J. Biomed. Inform. 46, S54–S62 (2013)
Article Google Scholar
Lin, Y.-K., Chen, H., Brown, R.A.: MedTime: a temporal information extraction system for clinical narratives. J. Biomed. Inform. 46, S20–S28 (2013)
Article Google Scholar
Xu, Y., Wang, Y., Liu, T., Tsujii, J., Chang, E.I.-C.: An end-to-end system to identify temporal relation in discharge summaries: 2012 i2b2 challenge. J. Am. Med. Inform. Assoc. 20, 849–858 (2013)
Article Google Scholar
Roberts, K., Rink, B., Harabagiu, S.M.: A flexible framework for recognizing events, temporal expressions, and temporal relations in clinical text. J. Am. Med. Inform. Assoc. 20, 867–875 (2013)
Article Google Scholar
Grouin, C., Grabar, N., Hamon, T., Rosset, S., Tannier, X., Zweigenbaum, P.: Eventual situations for timeline extraction from clinical reports. J. Am. Med. Inform. Assoc. 20, 820–827 (2013)
Article Google Scholar
Nikfarjam, A., Emadzadeh, E., Gonzalez, G.: Towards generating a patients timeline: extracting temporal relationships from clinical notes. J. Biomed. Inform. 46, S40–S47 (2013)
Article Google Scholar
Xu, H., AbdelRahman, S., Jiang, M., Fan, J.W., Huang, Y.: An initial study of full parsing of clinical text using the Stanford Parser. In: 2011 IEEE International Conference on Bioinformatics and Biomedicine Workshops (BIBMW), pp. 607–614 (2011)
Google Scholar
Griffis, D., Shivade, C., Fosler-Lussier, E., Lai, A.M.: A quantitative and qualitative evaluation of sentence boundary detection for the clinical domain. AMIA Summits Transl. Sci. Proc. 2016, 88–97 (2016)
Google Scholar
Bodenreider, O.: The Unified Medical Language System (UMLS): integrating biomedical terminology. Nucl. Acids Res. 32, D267–D270 (2004)
Article Google Scholar
Dehghan, A.: Boundary identification of events in clinical named entity recognition. arXiv:1308.1004 (2013)
Dehghan, A., Keane, J.A., Nenadic, G.: Challenges in clinical named entity recognition for decision support. In: 2013 IEEE International Conference on Systems, Man, and Cybernetics, pp. 947–951 (2013)
Google Scholar
Kang, N., Barendse, R.J., Afzal, Z., Singh, B., Schuemie, M.J., van Mulligen, E.M., Kors, J.A.: Erasmus MC approaches to the i2b2 Challenge. In: Proceedings of the 2010 i2b2/VA Workshop on Challenges in Natural Language Processing for Clinical Data, Boston, MA, USA: i2b2 (2010)
Google Scholar
Gurulingappa, H., Hofmann-Apitius, M., Fluck, J.: Concept identification and assertion classification in patient health records. In: Proceedings of the 2010 i2b2/VA Workshop on Challenges in Natural Language Processing for Clinical Data. i2b2, Boston, MA, USA (2010)
Google Scholar
deBruijn, B., Cherry, C., Kiritchenko, S., Martin, J., Zhu, X.: NRC at i2b2: one challenge, three practical tasks, nine statistical systems, hundreds of clinical records, millions of useful features. In: Proceedings of the 2010 i2b2/VA Workshop on Challenges in Natural Language Processing for Clinical Data. Boston, MA, USA: i2b2 (2010)
Google Scholar
Wang, Y., Patrick, J.: Cascading classifiers for named entity recognition in clinical notes. In: Proceedings of the Workshop on Biomedical Information Extraction, pp. 42–49. Association for Computational Linguistics (2009)
Google Scholar
Kim, Y., Riloff, E.: A stacked ensemble for medical concept extraction from clinical notes. AMIA Jt. Summits Transl. Sci. Proc. 2015: 737–746 (2015)
Google Scholar
Uzuner, Ö., South, B.R., Shen, S., DuVall, S.L.: 2010 i2b2/VA challenge on concepts, assertions, and relations in clinical text. J. Am. Med. Inform. Assoc. JAMIA 18, 552–556 (2011)
Article Google Scholar
Clark, C., Aberdeen, J., Coarr, M., Tresner-Kirsch, D., Wellner, B., Yeh, A., Hirschman, L.: MITRE system for clinical assertion status classification. J. Am. Med. Inform. Assoc. JAMIA 18, 563–567 (2011)
Article Google Scholar
Grouin, C., Abacha, A.B., Bernhard, D., Cartoni, B., Deleger, L., Grau, B., Ligozat, A.-L., Minard, A.-L., Rosset, S., Zweigenbaum, P.: CARAMBA: concept, assertion, and relation annotation using machine-learning based approaches. In: i2b2 Medication Extraction Challenge Workshop (2010)
Google Scholar
Clark, C., Aberdeen, J., Coarr, M., Tresner-Kirsch, D., Wellner, B., Yeh, A., Hirschman, L.: Determining assertion status for medical problems in clinical records (2011)
Google Scholar
Solt, I., Szidarovszky, F.P., Tikk, D.: Concept, assertion and relation extraction at the 2010 i2b2 relation extraction challenge using parsing information and dictionaries. In: Proceedings of i2b2/VA Shared-Task, Washington, DC (2010)
Google Scholar
Reeves, R.M., Ong, F.R., Matheny, M.E., Denny, J.C., Aronsky, D., Gobbel, G.T., Montella, D., Speroff, T., Brown, S.H.: Detecting temporal expressions in medical narratives. Int. J. Med. Inform. 82, 118–127 (2013)
Article Google Scholar
Sun, W., Rumshisky, A., Uzuner, O.: Evaluating temporal relations in clinical text: 2012 i2b2 Challenge. J. Am. Med. Inform. Assoc. 20, 806–813 (2013)
Article Google Scholar

Download references

Author information

Authors and Affiliations

School of Computer Science and IT, DAVV, Indore, 452001, Madhya Pradesh, India
Ruchi Patel & Sanjay Tanwani

Authors

Ruchi Patel
View author publications
You can also search for this author in PubMed Google Scholar
Sanjay Tanwani
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ruchi Patel .

Editor information

Editors and Affiliations

School of Computer Engineering, KIIT University, Bhubaneswar, Odisha, India
Manoj Kumar Mishra
School of Computer Engineering, KIIT University, Bhubaneswar, Odisha, India
Bhabani Shankar Prasad Mishra
Department of Computer Science and Engineering, IIT Patna, Patna, Bihar, India
Yashwant Singh Patel
Department of Computer Science and Engineering, IIT Patna, Patna, Bihar, India
Rajiv Misra

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Patel, R., Tanwani, S. (2019). Application of Machine Learning Techniques in Clinical Information Extraction. In: Mishra, M., Mishra, B., Patel, Y., Misra, R. (eds) Smart Techniques for a Smarter Planet. Studies in Fuzziness and Soft Computing, vol 374. Springer, Cham. https://doi.org/10.1007/978-3-030-03131-2_8

Download citation

DOI: https://doi.org/10.1007/978-3-030-03131-2_8
Published: 30 January 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-03130-5
Online ISBN: 978-3-030-03131-2
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics