Use of Domain Knowledge in the Automatic Extraction of Structured Representations from Patient-Related Texts

Angelova, Galia

doi:10.1007/978-3-642-14197-3_6

Galia Angelova²²

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 6208))

Included in the following conference series:

International Conference on Conceptual Structures

696 Accesses
1 Citations

Abstract

Domain knowledge is essential resource in Information Extraction (IE) from free text since it supports the decisions about structuring the extracted text objects into domain statements. Thus manually-created conceptual structures enable the semantic representation of textual information. This paper discusses the role of domain knowledge in information extraction of structured data from patient-related texts. The article shows that domain knowledge is encoded not only in the conceptual structures, which provide the ontological framework for the IE task, but also in the IE templates that are designed to capture domain semantics. A prototype system and IE examples of domain knowledge usage are considered together with results of the current prototype evaluation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Spasic, I., Ananiadou, S., McNaught, J., Kumar, A.: Text mining and ontologies in biomedicine: Making sense of raw text. Briefings in Bioinformatics 6(3), 239–251 (2005)
Article Google Scholar
Grishman, R., Sundheim, B.: Message understanding conference - 6: A brief history. In: Proceedings of the 16th International Conference on Computational Linguistics COLING 1996, Copenhagen (July 1996)
Google Scholar
Cunningham, H.: Information Extraction, Automatic. In: Encyclopedia of Language and Linguistics. Elsevier, Amsterdam (2005), http://gate.ac.uk/sale/ell2/ie/main.pdf (last visited April 2010)
Google Scholar
Boytcheva, S., Nikolova, I., Paskaleva, E., Angelova, G., Tcharaktchiev, D., Dimitrova, N.: Extraction and Exploration of Correlations in Patient Status Data. In: Savova, G., Karkaletsis, V., Angelova, G. (eds.) Biomedical Information Extraction, Proceedings of the International Workshop Held in Conjunction with RANLP 2009, Borovets, Bulgaria, September 18, vol. 18, pp. 1–7 (2009)
Google Scholar
Bulgarian Drug Agency, http://www.bda.bg/index.php?lang=en (last visited April 2010)
Roberts, A., Gaizauskas, R., Hepple, M., Guo, Y.: Combining terminology resources and statistical methods for entity recognition: an evaluation. In: Proc. of the Sixth Int. Conf. on Language Resources and Evaluation (LREC 2008). CLEF Clinical E-Science Framework, University of Sheffield (2008), http://nlp.shef.ac.uk/clef/ (last visited April 2010)
Novichkova, S., Egorov, S., Daraselia, N.: MedScan, a NL processing engine for MEDLINE abstracts. Bioinformatics 19(13), 1699–1706 (2003)
Article Google Scholar
Daraselia, N., Yuryev, A., Egorov, S., Novichkova, S., Nikitin, A., Mazo, I.: Extracting human protein interactions from Medline using a full-sentence parser. Bioinformatics 20(5), 604–611 (2004)
Article Google Scholar
Gangemi, A., Pisanelli, D.M., Steve, G.: Understanding Systematic Conceptual Structures in Polysemous Medical Terms. In: Marc Overhage, J. (ed.) Proc. of AMIA An. Symposium on Converging Information, Technology and Health Care (2000)
Google Scholar
Denecke, K., Kohlhof, I., Bernauer, J.: Use of Multiaxial Indexing for IE from Medical Texts. In: Proc. FCTC 2006, Int. Workshop on Foundations of Clinical Terminologies and Classifications, Timisoara, Romania, ROMEDINF (April 2006)
Google Scholar
Lee, C.H., Khoo, C., Na, J.C.: Automatic identification of treatment relations for medical ontology learning: An exploratory study. In: McIlwaine, I.C. (ed.) Knowledge Organization and the Global Information Society: Proc. of the Eighth Int. ISKO Conference, pp. 245–250. Ergon Verlag, Wurzburg (2004)
Google Scholar
Zhang, Y., Patrick, J.: Extracting Semantics in a Clinical Scenario. In: Roddick, J.F., Warren, J.R. (eds.) Proc. Australasian Workshop on Health Knowledge Management and Discovery (HKMD 2007), CRPIT, Ballarat, Australia, ACS, vol. 68, pp. 241–247 (2007)
Google Scholar
Boytcheva, S., Nikolova, I., Paskaleva, E., Angelova, G., Tcharaktchiev, D., Dimitrova, N.: Structuring of Status Descriptions in Hospital Patient Records. In: The Proc. 2nd Int. Workshop on Building and Evaluating Resources for BioMedical Text Mining, associated to the 7th Int. Conf. on Language Resources and Evaluation (LREC 2010), Malta (to appear) (May 2010)
Google Scholar
Sowa, J.: Conceptual Information Processing in Mind and Machines. Reading, MA (1984)
Google Scholar
Boytcheva, S., Strupchanska, A., Paskaleva, E., Tcharaktchiev, D.: Some Aspects of Negation Processing in Electronic Health Records. In: Proc. of International Workshop Language and Speech Infrastructure for Information Access in the Balkan Countries, Borovets, Bulgaria, pp. 1–8 (2005)
Google Scholar
BioPortal, http://bioportal.bioontology.org/visualize/13578/Diabetes_Mellitus (last visited April 2010)
Boytcheva, S., Angelova, G.: Towards Extraction of Conceptual Structures from Electronic Health Records. In: Rudolph, S., Dau, F., Kuznetsov, S.O. (eds.) Conceptual Structures: Leveraging Semantic Technologies. LNCS (LNAI), vol. 5662, pp. 100–113. Springer, Heidelberg (2009)
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

Institute for Parallel Processing, Bulgarian Academy of Sciences, 25A Acad. G. Bonchev Str., 1113, Sofia, Bulgaria
Galia Angelova

Authors

Galia Angelova
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

LIRMM, Université Montpellier,
Madalina Croitoru
Irisa/Ifsic, Université de Rennes 1, Campus Universitaire de Beaulieu, 35042, Rennes cedex, France
Sébastien Ferré
MIMOS BERHAD, Kuala Lumpur, MALAYSIA
Dickson Lukose

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Angelova, G. (2010). Use of Domain Knowledge in the Automatic Extraction of Structured Representations from Patient-Related Texts. In: Croitoru, M., Ferré, S., Lukose, D. (eds) Conceptual Structures: From Information to Intelligence. ICCS 2010. Lecture Notes in Computer Science(), vol 6208. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-14197-3_6

Download citation

DOI: https://doi.org/10.1007/978-3-642-14197-3_6
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-14196-6
Online ISBN: 978-3-642-14197-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics