Skip to main content

Lung Cancer Concept Annotation from Spanish Clinical Narratives

  • Conference paper
  • First Online:
Data Integration in the Life Sciences (DILS 2018)

Abstract

Recent rapid increase in the generation of clinical data and rapid development of computational science make us able to extract new insights from massive datasets in healthcare industry. Oncological Electronic Health Records (EHRs) are creating rich databases for documenting patient’s history and they potentially contain a lot of patterns that can help in better management of the disease. However, these patterns are locked within free text (unstructured) portions of EHRs and consequence in limiting health professionals to extract useful information from them and to finally perform Query and Answering (Q&A) process in an accurate way. The Information Extraction (IE) process requires Natural Language Processing (NLP) techniques to assign semantics to these patterns. Therefore, in this paper, we analyze the design of annotators for specific lung cancer concepts that can be integrated over Apache Unstructured Information Management Architecture (UIMA) framework. In addition, we explain the details of generation and storage of annotation outcomes.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Cancer, World Health Organization. http://www.who.int/news-room/fact-sheets/detail/cancer. Accessed 12 July 2018

  2. 1 in 4 deaths caused by cancer in the EU28. http://ec.europa.eu/eurostat/web/products-press-releases/-/3-25112014-BP. Accessed 21 June 2018

  3. Luengo-Fernandez, R., Leal, J., Gray, A., Sullivan, R.: Economic burden of cancer across the European Union: a population-based cost analysis. Lancet Oncol. 14(12), 1165–1174 (2013)

    Article  Google Scholar 

  4. Shlomi, D., et al.: Non-invasive early detection of malignant pulmonary nodules by FISH-based sputum test. Cancer Genet. 226–227, 1–10 (2018)

    Article  Google Scholar 

  5. Zaman, A., Bivona, T.G.: Emerging application of genomics-guided therapeutics in personalized lung cancer treatment. Ann. Transl. Med. 6(9), 160 (2018)

    Article  Google Scholar 

  6. Molecular profiling of lung cancer - my cancer genome. https://www.mycancergenome.org/content/disease/lung-cancer/. Accessed 21 June 2018

  7. NCI Dictionary of Cancer Terms, National Cancer Institute. https://www.cancer.gov/publications/dictionaries/cancer-terms. Accessed 21 June 2018

  8. Ahmadzada, T., Kao, S., Reid, G., Boyer, M., Mahar, A., Cooper, W.: An update on predictive biomarkers for treatment selection in non-small cell lung cancer. J. Clin. Med. 7(6), 153 (2018)

    Article  Google Scholar 

  9. Oser, M.G., Niederst, M.J., Sequist, L.V., Engelman, J.A.: Transformation from non-small-cell lung cancer to small-cell lung cancer: molecular drivers and cells of origin. Lancet Oncol. 16(4), e165–e172 (2015)

    Article  Google Scholar 

  10. Iwahara, T., et al.: Molecular characterization of ALK, a receptor tyrosine kinase expressed specifically in the nervous system. Oncogene 14(4), 439–449 (1997)

    Article  Google Scholar 

  11. Rimkunas, V.M., et al.: Analysis of receptor tyrosine kinase ROS1-positive tumors in non-small cell lung cancer: identification of a FIG-ROS1 fusion. Clin. Cancer Res. 18(16), 4449–4457 (2012)

    Article  Google Scholar 

  12. AJCC - Implementation of AJCC 8th Edition Cancer Staging System. https://cancerstaging.org/About/news/Pages/Implementation-of-AJCC-8th-Edition-Cancer-Staging-System.aspx. Accessed 14 Mar 2018

  13. Detterbeck, F.C., Boffa, D.J., Kim, A.W., Tanoue, L.T.: The eighth edition lung cancer stage classification. Chest 151(1), 193–203 (2017)

    Article  Google Scholar 

  14. Mak, K.S., et al.: Defining a standard set of patient-centred outcomes for lung cancer. Eur. Respir. J. 48(3), 852–860 (2016)

    Article  Google Scholar 

  15. Performance scales: Karnofsky & ECOG scores practice tools| OncologyPRO. https://oncologypro.esmo.org/Oncology-in-Practice/Practice-Tools/Performance-Scales. Accessed 12 July 2018

  16. Oken, M.M., et al.: Toxicity and response criteria of the Eastern Cooperative Oncology Group. Am. J. Clin. Oncol. 5(6), 649–655 (1982)

    Article  Google Scholar 

  17. Hanauer, D.A., Mei, Q., Law, J., Khanna, R., Zheng, K.: Supporting information retrieval from electronic health records: a report of University of Michigan’s nine-year experience in developing and using the Electronic Medical Record Search Engine (EMERSE). J. Biomed. Inform. 55, 290–300 (2015)

    Article  Google Scholar 

  18. Wang, Y., et al.: Clinical information extraction applications: a literature review. J. Biomed. Inform. 77, 34–49 (2018)

    Article  Google Scholar 

  19. SNOMED International. https://www.snomed.org/. Accessed 13 July 2018

  20. Unified Medical Language System (UMLS). https://www.nlm.nih.gov/research/umls/. Accessed 04 May 2018

  21. Savova, G.K., et al.: Mayo clinical Text Analysis and Knowledge Extraction System (cTAKES): architecture, component evaluation and applications. J. Am. Med. Inform. Assoc. 17(5), 507–513 (2010)

    Article  Google Scholar 

  22. Friedman, C., Hripcsak, G., DuMouchel, W., Johnson, S.B., Clayton, P.D.: Natural language processing in an operational clinical information system. Nat. Lang. Eng. 1(1), 83–108 (1995)

    Article  Google Scholar 

  23. Coden, A., et al.: Automatically extracting cancer disease characteristics from pathology reports into a Disease Knowledge Representation Model. J. Biomed. Inform. 42(5), 937–949 (2009)

    Article  Google Scholar 

  24. Zeng, Q.T., Goryachev, S., Weiss, S., Sordo, M., Murphy, S.N., Lazarus, R.: Extracting principal diagnosis, co-morbidity and smoking status for asthma research: evaluation of a natural language processing system. BMC Med. Inform. Decis. Mak. 6, 30 (2006)

    Article  Google Scholar 

  25. Aronson, A.R.: Effective mapping of biomedical text to the UMLS Metathesaurus: the MetaMap program. In: Proceedings of the AMIA Symposium, pp. 17–21 (2001)

    Google Scholar 

  26. de la Concha, V.G., et al.: EL ESPAÑOL: UNA LENGUA VIVA

    Google Scholar 

  27. Menasalvas Ruiz, E., et al.: Profiling lung cancer patients using electronic health records. J. Med. Syst. 42(7), 126 (2018)

    Article  Google Scholar 

  28. Menasalvas, E., Rodriguez-Gonzalez, A., Costumero, R., Ambit, H., Gonzalo, C.: Clinical narrative analytics challenges. In: Flores, V., et al. (eds.) IJCRS 2016. LNCS (LNAI), vol. 9920, pp. 23–32. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-47160-0_2

    Chapter  Google Scholar 

  29. Detterbeck, F.C.: The eighth edition TNM stage classification for lung cancer: what does it mean on main street? J. Thorac. Cardiovasc. Surg. 155(1), 356–359 (2018)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Alejandro Rodríguez-González .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Najafabadipour, M., Tuñas, J.M., Rodríguez-González, A., Menasalvas, E. (2019). Lung Cancer Concept Annotation from Spanish Clinical Narratives. In: Auer, S., Vidal, ME. (eds) Data Integration in the Life Sciences. DILS 2018. Lecture Notes in Computer Science(), vol 11371. Springer, Cham. https://doi.org/10.1007/978-3-030-06016-9_15

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-06016-9_15

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-06015-2

  • Online ISBN: 978-3-030-06016-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics