Abstract
Domain knowledge is essential resource in Information Extraction (IE) from free text since it supports the decisions about structuring the extracted text objects into domain statements. Thus manually-created conceptual structures enable the semantic representation of textual information. This paper discusses the role of domain knowledge in information extraction of structured data from patient-related texts. The article shows that domain knowledge is encoded not only in the conceptual structures, which provide the ontological framework for the IE task, but also in the IE templates that are designed to capture domain semantics. A prototype system and IE examples of domain knowledge usage are considered together with results of the current prototype evaluation.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Spasic, I., Ananiadou, S., McNaught, J., Kumar, A.: Text mining and ontologies in biomedicine: Making sense of raw text. Briefings in Bioinformatics 6(3), 239–251 (2005)
Grishman, R., Sundheim, B.: Message understanding conference - 6: A brief history. In: Proceedings of the 16th International Conference on Computational Linguistics COLING 1996, Copenhagen (July 1996)
Cunningham, H.: Information Extraction, Automatic. In: Encyclopedia of Language and Linguistics. Elsevier, Amsterdam (2005), http://gate.ac.uk/sale/ell2/ie/main.pdf (last visited April 2010)
Boytcheva, S., Nikolova, I., Paskaleva, E., Angelova, G., Tcharaktchiev, D., Dimitrova, N.: Extraction and Exploration of Correlations in Patient Status Data. In: Savova, G., Karkaletsis, V., Angelova, G. (eds.) Biomedical Information Extraction, Proceedings of the International Workshop Held in Conjunction with RANLP 2009, Borovets, Bulgaria, September 18, vol. 18, pp. 1–7 (2009)
Bulgarian Drug Agency, http://www.bda.bg/index.php?lang=en (last visited April 2010)
Roberts, A., Gaizauskas, R., Hepple, M., Guo, Y.: Combining terminology resources and statistical methods for entity recognition: an evaluation. In: Proc. of the Sixth Int. Conf. on Language Resources and Evaluation (LREC 2008). CLEF Clinical E-Science Framework, University of Sheffield (2008), http://nlp.shef.ac.uk/clef/ (last visited April 2010)
Novichkova, S., Egorov, S., Daraselia, N.: MedScan, a NL processing engine for MEDLINE abstracts. Bioinformatics 19(13), 1699–1706 (2003)
Daraselia, N., Yuryev, A., Egorov, S., Novichkova, S., Nikitin, A., Mazo, I.: Extracting human protein interactions from Medline using a full-sentence parser. Bioinformatics 20(5), 604–611 (2004)
Gangemi, A., Pisanelli, D.M., Steve, G.: Understanding Systematic Conceptual Structures in Polysemous Medical Terms. In: Marc Overhage, J. (ed.) Proc. of AMIA An. Symposium on Converging Information, Technology and Health Care (2000)
Denecke, K., Kohlhof, I., Bernauer, J.: Use of Multiaxial Indexing for IE from Medical Texts. In: Proc. FCTC 2006, Int. Workshop on Foundations of Clinical Terminologies and Classifications, Timisoara, Romania, ROMEDINF (April 2006)
Lee, C.H., Khoo, C., Na, J.C.: Automatic identification of treatment relations for medical ontology learning: An exploratory study. In: McIlwaine, I.C. (ed.) Knowledge Organization and the Global Information Society: Proc. of the Eighth Int. ISKO Conference, pp. 245–250. Ergon Verlag, Wurzburg (2004)
Zhang, Y., Patrick, J.: Extracting Semantics in a Clinical Scenario. In: Roddick, J.F., Warren, J.R. (eds.) Proc. Australasian Workshop on Health Knowledge Management and Discovery (HKMD 2007), CRPIT, Ballarat, Australia, ACS, vol. 68, pp. 241–247 (2007)
Boytcheva, S., Nikolova, I., Paskaleva, E., Angelova, G., Tcharaktchiev, D., Dimitrova, N.: Structuring of Status Descriptions in Hospital Patient Records. In: The Proc. 2nd Int. Workshop on Building and Evaluating Resources for BioMedical Text Mining, associated to the 7th Int. Conf. on Language Resources and Evaluation (LREC 2010), Malta (to appear) (May 2010)
Sowa, J.: Conceptual Information Processing in Mind and Machines. Reading, MA (1984)
Boytcheva, S., Strupchanska, A., Paskaleva, E., Tcharaktchiev, D.: Some Aspects of Negation Processing in Electronic Health Records. In: Proc. of International Workshop Language and Speech Infrastructure for Information Access in the Balkan Countries, Borovets, Bulgaria, pp. 1–8 (2005)
BioPortal, http://bioportal.bioontology.org/visualize/13578/Diabetes_Mellitus (last visited April 2010)
Boytcheva, S., Angelova, G.: Towards Extraction of Conceptual Structures from Electronic Health Records. In: Rudolph, S., Dau, F., Kuznetsov, S.O. (eds.) Conceptual Structures: Leveraging Semantic Technologies. LNCS (LNAI), vol. 5662, pp. 100–113. Springer, Heidelberg (2009)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Angelova, G. (2010). Use of Domain Knowledge in the Automatic Extraction of Structured Representations from Patient-Related Texts. In: Croitoru, M., Ferré, S., Lukose, D. (eds) Conceptual Structures: From Information to Intelligence. ICCS 2010. Lecture Notes in Computer Science(), vol 6208. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-14197-3_6
Download citation
DOI: https://doi.org/10.1007/978-3-642-14197-3_6
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-14196-6
Online ISBN: 978-3-642-14197-3
eBook Packages: Computer ScienceComputer Science (R0)