Skip to main content

Use of Domain Knowledge in the Automatic Extraction of Structured Representations from Patient-Related Texts

  • Conference paper
Conceptual Structures: From Information to Intelligence (ICCS 2010)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 6208))

Included in the following conference series:

Abstract

Domain knowledge is essential resource in Information Extraction (IE) from free text since it supports the decisions about structuring the extracted text objects into domain statements. Thus manually-created conceptual structures enable the semantic representation of textual information. This paper discusses the role of domain knowledge in information extraction of structured data from patient-related texts. The article shows that domain knowledge is encoded not only in the conceptual structures, which provide the ontological framework for the IE task, but also in the IE templates that are designed to capture domain semantics. A prototype system and IE examples of domain knowledge usage are considered together with results of the current prototype evaluation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Spasic, I., Ananiadou, S., McNaught, J., Kumar, A.: Text mining and ontologies in biomedicine: Making sense of raw text. Briefings in Bioinformatics 6(3), 239–251 (2005)

    Article  Google Scholar 

  2. Grishman, R., Sundheim, B.: Message understanding conference - 6: A brief history. In: Proceedings of the 16th International Conference on Computational Linguistics COLING 1996, Copenhagen (July 1996)

    Google Scholar 

  3. Cunningham, H.: Information Extraction, Automatic. In: Encyclopedia of Language and Linguistics. Elsevier, Amsterdam (2005), http://gate.ac.uk/sale/ell2/ie/main.pdf (last visited April 2010)

    Google Scholar 

  4. Boytcheva, S., Nikolova, I., Paskaleva, E., Angelova, G., Tcharaktchiev, D., Dimitrova, N.: Extraction and Exploration of Correlations in Patient Status Data. In: Savova, G., Karkaletsis, V., Angelova, G. (eds.) Biomedical Information Extraction, Proceedings of the International Workshop Held in Conjunction with RANLP 2009, Borovets, Bulgaria, September 18, vol. 18, pp. 1–7 (2009)

    Google Scholar 

  5. Bulgarian Drug Agency, http://www.bda.bg/index.php?lang=en (last visited April 2010)

  6. Roberts, A., Gaizauskas, R., Hepple, M., Guo, Y.: Combining terminology resources and statistical methods for entity recognition: an evaluation. In: Proc. of the Sixth Int. Conf. on Language Resources and Evaluation (LREC 2008). CLEF Clinical E-Science Framework, University of Sheffield (2008), http://nlp.shef.ac.uk/clef/ (last visited April 2010)

  7. Novichkova, S., Egorov, S., Daraselia, N.: MedScan, a NL processing engine for MEDLINE abstracts. Bioinformatics 19(13), 1699–1706 (2003)

    Article  Google Scholar 

  8. Daraselia, N., Yuryev, A., Egorov, S., Novichkova, S., Nikitin, A., Mazo, I.: Extracting human protein interactions from Medline using a full-sentence parser. Bioinformatics 20(5), 604–611 (2004)

    Article  Google Scholar 

  9. Gangemi, A., Pisanelli, D.M., Steve, G.: Understanding Systematic Conceptual Structures in Polysemous Medical Terms. In: Marc Overhage, J. (ed.) Proc. of AMIA An. Symposium on Converging Information, Technology and Health Care (2000)

    Google Scholar 

  10. Denecke, K., Kohlhof, I., Bernauer, J.: Use of Multiaxial Indexing for IE from Medical Texts. In: Proc. FCTC 2006, Int. Workshop on Foundations of Clinical Terminologies and Classifications, Timisoara, Romania, ROMEDINF (April 2006)

    Google Scholar 

  11. Lee, C.H., Khoo, C., Na, J.C.: Automatic identification of treatment relations for medical ontology learning: An exploratory study. In: McIlwaine, I.C. (ed.) Knowledge Organization and the Global Information Society: Proc. of the Eighth Int. ISKO Conference, pp. 245–250. Ergon Verlag, Wurzburg (2004)

    Google Scholar 

  12. Zhang, Y., Patrick, J.: Extracting Semantics in a Clinical Scenario. In: Roddick, J.F., Warren, J.R. (eds.) Proc. Australasian Workshop on Health Knowledge Management and Discovery (HKMD 2007), CRPIT, Ballarat, Australia, ACS, vol. 68, pp. 241–247 (2007)

    Google Scholar 

  13. Boytcheva, S., Nikolova, I., Paskaleva, E., Angelova, G., Tcharaktchiev, D., Dimitrova, N.: Structuring of Status Descriptions in Hospital Patient Records. In: The Proc. 2nd Int. Workshop on Building and Evaluating Resources for BioMedical Text Mining, associated to the 7th Int. Conf. on Language Resources and Evaluation (LREC 2010), Malta (to appear) (May 2010)

    Google Scholar 

  14. Sowa, J.: Conceptual Information Processing in Mind and Machines. Reading, MA (1984)

    Google Scholar 

  15. Boytcheva, S., Strupchanska, A., Paskaleva, E., Tcharaktchiev, D.: Some Aspects of Negation Processing in Electronic Health Records. In: Proc. of International Workshop Language and Speech Infrastructure for Information Access in the Balkan Countries, Borovets, Bulgaria, pp. 1–8 (2005)

    Google Scholar 

  16. BioPortal, http://bioportal.bioontology.org/visualize/13578/Diabetes_Mellitus (last visited April 2010)

  17. Boytcheva, S., Angelova, G.: Towards Extraction of Conceptual Structures from Electronic Health Records. In: Rudolph, S., Dau, F., Kuznetsov, S.O. (eds.) Conceptual Structures: Leveraging Semantic Technologies. LNCS (LNAI), vol. 5662, pp. 100–113. Springer, Heidelberg (2009)

    Chapter  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Angelova, G. (2010). Use of Domain Knowledge in the Automatic Extraction of Structured Representations from Patient-Related Texts. In: Croitoru, M., Ferré, S., Lukose, D. (eds) Conceptual Structures: From Information to Intelligence. ICCS 2010. Lecture Notes in Computer Science(), vol 6208. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-14197-3_6

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-14197-3_6

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-14196-6

  • Online ISBN: 978-3-642-14197-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics