Skip to main content

Application of Machine Learning Techniques in Clinical Information Extraction

  • Chapter
  • First Online:
Smart Techniques for a Smarter Planet

Part of the book series: Studies in Fuzziness and Soft Computing ((STUDFUZZ,volume 374))

Abstract

A large number of medical research papers and clinical notes on disease diagnostic, treatment and prevention are increasing every day. This biomedical text provides a rich source of knowledge for biomedical research. However, this medical information is scattered in vast medical informatics literature in unstructured form. It is requisite to retrieve imperative information from these publications and discover new knowledge. A lot of research is done in biomedical text mining using different methods and techniques. Centre of i2b2 organized different challenges on natural language processing for medical text. In i2b2 2010, challenge tasks were focused on concept extraction, assertion classification and relation extraction, and in 2012, the task was temporal information extraction. In previous work, various machine learning techniques are found to be one of the effective techniques to extract clinical information from different types of medical data like discharge summary, physical notes. This paper presents the review of earlier work on different machine learning techniques and methods for medical research. The effectiveness of these techniques has been measured by precision, recall and F-score. This review will be useful for biomedical researchers to identify best techniques for the further research in clinical information extraction.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Minard, A.-L., Ligozat, A.-L., Ben Abacha, A., Bernhard, D., Cartoni, B., Deléger, L., Grau, B., Rosset, S., Zweigenbaum, P., Grouin, C.: Hybrid methods for improving information access in clinical documents: concept, assertion, and relation identification. J. Am. Med. Inform. Assoc. 18, 588 (2011)

    Article  Google Scholar 

  2. Patrick, J.D., Nguyen, D.H.M., Wang, Y., Li, M.: A knowledge discovery and reuse pipeline for information extraction in clinical notes. J. Am. Med. Inform. Assoc. 18, 574–579 (2011)

    Article  Google Scholar 

  3. Zhang, S., Elhadad, N.: Unsupervised biomedical named entity recognition: experiments with clinical and biological texts. J. Biomed. Inform. 46, 1088–1098 (2013)

    Article  Google Scholar 

  4. Jiang, M., Chen, Y., Liu, M., Rosenbloom, S.T., Mani, S., Denny, J.C., Xu, H.: A study of machine-learning-based approaches to extract clinical entities and their assertions from discharge summaries. J. Am. Med. Inform. Assoc. JAMIA 18, 601–606 (2011)

    Article  Google Scholar 

  5. Torii, M., Wagholikar, K., Liu, H.: Using machine learning for concept extraction on clinical documents from multiple data sources. J. Am. Med. Inform. Assoc. JAMIA 18, 580–587 (2011)

    Article  Google Scholar 

  6. Jonnalagadda, S., Cohen, T., Wu, S., Gonzalez, G.: Enhancing clinical concept extraction with distributional semantics. J. Biomed. Inform. 45, 129–140 (2012)

    Article  Google Scholar 

  7. Tang, B., Cao, H., Wu, Y., Jiang, M., Xu, H.: Recognizing clinical entities in hospital discharge summaries using Structural Support Vector Machines with word representation features. BMC Med. Inform. Decis. Mak. 13, S1 (2013)

    Article  Google Scholar 

  8. Tang, B., Cao, H., Wu, Y., Jiang, M., Xu, H.: Clinical entity recognition using structural support vector machines with rich features. In: Proceedings of the ACM Sixth International Workshop on Data and Text Mining In Biomedical Informatics, pp. 13–20. ACM, New York (2012)

    Google Scholar 

  9. Kim, Y., Riloff, E.: A stacked ensemble for medical concept extraction from clinical notes

    Google Scholar 

  10. Kang, N., Afzal, Z., Singh, B., van Mulligen, E.M., Kors, J.A.: Using an ensemble system to improve concept extraction from clinical records. J. Biomed. Inform. 45, 423–428 (2012)

    Article  Google Scholar 

  11. Gobbel, G.T., Reeves, R., Jayaramaraja, S., Giuse, D., Speroff, T., Brown, S.H., Elkin, P.L., Matheny, M.E.: Development and evaluation of RapTAT: a machine learning system for concept mapping of phrases from medical narratives. J. Biomed. Inform. 48, 54–65 (2014)

    Article  Google Scholar 

  12. de Bruijn, B., Cherry, C., Kiritchenko, S., Martin, J., Zhu, X.: Machine-learned solutions for three stages of clinical information extraction: the state of the art at i2b2 2010. J. Am. Med. Inform. Assoc. JAMIA 18, 557–562 (2011)

    Article  Google Scholar 

  13. Roberts, K., Harabagiu, S.M.: A flexible framework for deriving assertions from electronic medical records. J. Am. Med. Inform. Assoc. JAMIA 18, 568–573 (2011)

    Article  Google Scholar 

  14. Xu, Y., Hong, K., Tsujii, J., Chang, E.I.C.: Feature engineering combined with machine learning and rule-based methods for structured information extraction from narrative clinical discharge summaries. J. Am. Med. Inform. Assoc. JAMIA 19, 824–832 (2012)

    Article  Google Scholar 

  15. Rink, B., Harabagiu, S., Roberts, K.: Automatic extraction of relations between medical concepts in clinical texts. J. Am. Med. Inform. Assoc. 18, 594–600 (2011)

    Article  Google Scholar 

  16. Cherry, C., Zhu, X., Martin, J., de Bruijn, B.: A la Recherche du Temps Perdu: extracting temporal relations from medical text in the 2012 i2b2 NLP challenge. J. Am. Med. Inform. Assoc. 20, 843–848 (2013)

    Article  Google Scholar 

  17. Tang, B., Wu, Y., Jiang, M., Chen, Y., Denny, J.C., Xu, H.: A hybrid system for temporal information extraction from clinical text. J. Am. Med. Inform. Assoc. 20, 828–835 (2013)

    Article  Google Scholar 

  18. Cheng, Y., Anick, P., Hong, P., Xue, N.: Temporal relation discovery between events and temporal expressions identified in clinical narrative. J. Biomed. Inform. 46, S48–S53 (2013)

    Article  Google Scholar 

  19. Sun, W., Rumshisky, A., Uzuner, O.: Temporal reasoning over clinical text: the state of the art. J. Am. Med. Inform. Assoc. 20, 814–819 (2013)

    Article  Google Scholar 

  20. Chang, Y.-C., Dai, H.-J., Wu, J.C.-Y., Chen, J.-M., Tsai, R.T.-H., Hsu, W.-L.: TEMPTING system: a hybrid method of rule and machine learning for temporal relation extraction in patient discharge summaries. J. Biomed. Inform. 46, S54–S62 (2013)

    Article  Google Scholar 

  21. Lin, Y.-K., Chen, H., Brown, R.A.: MedTime: a temporal information extraction system for clinical narratives. J. Biomed. Inform. 46, S20–S28 (2013)

    Article  Google Scholar 

  22. Xu, Y., Wang, Y., Liu, T., Tsujii, J., Chang, E.I.-C.: An end-to-end system to identify temporal relation in discharge summaries: 2012 i2b2 challenge. J. Am. Med. Inform. Assoc. 20, 849–858 (2013)

    Article  Google Scholar 

  23. Roberts, K., Rink, B., Harabagiu, S.M.: A flexible framework for recognizing events, temporal expressions, and temporal relations in clinical text. J. Am. Med. Inform. Assoc. 20, 867–875 (2013)

    Article  Google Scholar 

  24. Grouin, C., Grabar, N., Hamon, T., Rosset, S., Tannier, X., Zweigenbaum, P.: Eventual situations for timeline extraction from clinical reports. J. Am. Med. Inform. Assoc. 20, 820–827 (2013)

    Article  Google Scholar 

  25. Nikfarjam, A., Emadzadeh, E., Gonzalez, G.: Towards generating a patients timeline: extracting temporal relationships from clinical notes. J. Biomed. Inform. 46, S40–S47 (2013)

    Article  Google Scholar 

  26. Xu, H., AbdelRahman, S., Jiang, M., Fan, J.W., Huang, Y.: An initial study of full parsing of clinical text using the Stanford Parser. In: 2011 IEEE International Conference on Bioinformatics and Biomedicine Workshops (BIBMW), pp. 607–614 (2011)

    Google Scholar 

  27. Griffis, D., Shivade, C., Fosler-Lussier, E., Lai, A.M.: A quantitative and qualitative evaluation of sentence boundary detection for the clinical domain. AMIA Summits Transl. Sci. Proc. 2016, 88–97 (2016)

    Google Scholar 

  28. Bodenreider, O.: The Unified Medical Language System (UMLS): integrating biomedical terminology. Nucl. Acids Res. 32, D267–D270 (2004)

    Article  Google Scholar 

  29. Dehghan, A.: Boundary identification of events in clinical named entity recognition. arXiv:1308.1004 (2013)

  30. Dehghan, A., Keane, J.A., Nenadic, G.: Challenges in clinical named entity recognition for decision support. In: 2013 IEEE International Conference on Systems, Man, and Cybernetics, pp. 947–951 (2013)

    Google Scholar 

  31. Kang, N., Barendse, R.J., Afzal, Z., Singh, B., Schuemie, M.J., van Mulligen, E.M., Kors, J.A.: Erasmus MC approaches to the i2b2 Challenge. In: Proceedings of the 2010 i2b2/VA Workshop on Challenges in Natural Language Processing for Clinical Data, Boston, MA, USA: i2b2 (2010)

    Google Scholar 

  32. Gurulingappa, H., Hofmann-Apitius, M., Fluck, J.: Concept identification and assertion classification in patient health records. In: Proceedings of the 2010 i2b2/VA Workshop on Challenges in Natural Language Processing for Clinical Data. i2b2, Boston, MA, USA (2010)

    Google Scholar 

  33. deBruijn, B., Cherry, C., Kiritchenko, S., Martin, J., Zhu, X.: NRC at i2b2: one challenge, three practical tasks, nine statistical systems, hundreds of clinical records, millions of useful features. In: Proceedings of the 2010 i2b2/VA Workshop on Challenges in Natural Language Processing for Clinical Data. Boston, MA, USA: i2b2 (2010)

    Google Scholar 

  34. Wang, Y., Patrick, J.: Cascading classifiers for named entity recognition in clinical notes. In: Proceedings of the Workshop on Biomedical Information Extraction, pp. 42–49. Association for Computational Linguistics (2009)

    Google Scholar 

  35. Kim, Y., Riloff, E.: A stacked ensemble for medical concept extraction from clinical notes. AMIA Jt. Summits Transl. Sci. Proc. 2015: 737–746 (2015)

    Google Scholar 

  36. Uzuner, Ö., South, B.R., Shen, S., DuVall, S.L.: 2010 i2b2/VA challenge on concepts, assertions, and relations in clinical text. J. Am. Med. Inform. Assoc. JAMIA 18, 552–556 (2011)

    Article  Google Scholar 

  37. Clark, C., Aberdeen, J., Coarr, M., Tresner-Kirsch, D., Wellner, B., Yeh, A., Hirschman, L.: MITRE system for clinical assertion status classification. J. Am. Med. Inform. Assoc. JAMIA 18, 563–567 (2011)

    Article  Google Scholar 

  38. Grouin, C., Abacha, A.B., Bernhard, D., Cartoni, B., Deleger, L., Grau, B., Ligozat, A.-L., Minard, A.-L., Rosset, S., Zweigenbaum, P.: CARAMBA: concept, assertion, and relation annotation using machine-learning based approaches. In: i2b2 Medication Extraction Challenge Workshop (2010)

    Google Scholar 

  39. Clark, C., Aberdeen, J., Coarr, M., Tresner-Kirsch, D., Wellner, B., Yeh, A., Hirschman, L.: Determining assertion status for medical problems in clinical records (2011)

    Google Scholar 

  40. Solt, I., Szidarovszky, F.P., Tikk, D.: Concept, assertion and relation extraction at the 2010 i2b2 relation extraction challenge using parsing information and dictionaries. In: Proceedings of i2b2/VA Shared-Task, Washington, DC (2010)

    Google Scholar 

  41. Reeves, R.M., Ong, F.R., Matheny, M.E., Denny, J.C., Aronsky, D., Gobbel, G.T., Montella, D., Speroff, T., Brown, S.H.: Detecting temporal expressions in medical narratives. Int. J. Med. Inform. 82, 118–127 (2013)

    Article  Google Scholar 

  42. Sun, W., Rumshisky, A., Uzuner, O.: Evaluating temporal relations in clinical text: 2012 i2b2 Challenge. J. Am. Med. Inform. Assoc. 20, 806–813 (2013)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ruchi Patel .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Patel, R., Tanwani, S. (2019). Application of Machine Learning Techniques in Clinical Information Extraction. In: Mishra, M., Mishra, B., Patel, Y., Misra, R. (eds) Smart Techniques for a Smarter Planet. Studies in Fuzziness and Soft Computing, vol 374. Springer, Cham. https://doi.org/10.1007/978-3-030-03131-2_8

Download citation

Publish with us

Policies and ethics