Automatic Annotation of Medical Records in Spanish with Disease, Drug and Substance Names

Oronoz, Maite; Casillas, Arantza; Gojenola, Koldo; Perez, Alicia

doi:10.1007/978-3-642-41827-3_67

Maite Oronoz¹⁸,
Arantza Casillas¹⁹,
Koldo Gojenola¹⁸ &
…
Alicia Perez¹⁸

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 8259))

Included in the following conference series:

Iberoamerican Congress on Pattern Recognition

2158 Accesses
12 Citations

Abstract

This paper presents an annotation tool that detects entities in the biomedical domain. By enriching the lexica of the Freeling analyzer with bio-medical terms extracted from dictionaries and ontologies as SNOMED CT, the system is able to automatically detect medical terms in texts. An evaluation has been performed against a manually tagged corpus focusing on entities referring to pharmaceutical drug-names, substances and diseases. The obtained results show that a good annotation tool would help to leverage subsequent processes as data mining or pattern recognition tasks in the biomedical domain.

Download to read the full chapter text

Chapter PDF

Identifying named entities from PubMed® for enriching semantic categories

Article Open access 21 February 2015

SIFR annotator: ontology-based semantic annotation of French biomedical text and clinical notes

Article Open access 06 November 2018

Detecting Named Entities and Relations in German Clinical Reports

Index Terms

References

Jimeno-Yepes, A., Prieur-Gaston, É., Névéol, A.: Combining medline and publisher data to create parallel corpora for the automatic translation of biomedical text. BMC Bioinformatics 14, 146 (2013)
Article Google Scholar
Tiedemann, J.: Parallel data, tools and interfaces in opus. In: Proc. Language Resources and Evaluation, LREC (2012)
Google Scholar
Wu, Y., Abe, K., Dixon, P.R., Hori, C., Kashioka, H.: Leveraging Social Annotation for Topic Language Model Adaptation. In: Proc. International Speech Communication Association (INTERSPEECH) (2012)
Google Scholar
Stenetorp, P., Pyysalo, S., Topić, G., Ohta, T., Ananiadou, S., Tsujii, J.: Brat: A web-based tool for nlp-assisted text annotation. In: Proc. EACL (2012)
Google Scholar
Padró, L., Reese, S., Agirre, E., Soroa, A.: Semantic Services in Freeling 2.1: WordNet and UKB. In: Global Wordnet Conference, Mumbai, India (2010)
Google Scholar
Tsuruoka, Y., Tateishi, Y., Kim, J., Ohta, T., McNaught, J., Ananiadou, S., Tsujii, J.: Developing a Robust Part-of-Speech Tagger for Biomedical Text. In: 10th Panhellenic Conference on Informatics (2005)
Google Scholar
Patrick, J., Wang, Y., Budd, P.: An Automated System for Conversion of Clinical Notes into SNOMED Clinical Terminology. In: Proc. Australasian symposium on ACSW frontiers, ACSW 2007, vol. 68, pp. 219–226 (2007)
Google Scholar
Aronson, A.: Effective Mapping of Biomedical Text to the UMLS Metathesaurus: the MetaMap program. In: Proc. of AMIAS, pp. 17–21 (2001)
Google Scholar
Carrero, F.M., Cortizo, J.C., Gómez, J.M., de Buenaga, M.: In the Development of a Spanish Metamap. In: Proc. of the 17th ACM Conference on Information and Knowledge Management, pp. 1465–1466 (2008)
Google Scholar
Castro, E., Iglesias, A., Martínez, P., Castaño, L.: Automatic Identification of Biomedical Concepts in Spanish-Language Unstructured Clinical Texts. In: Proc. of the 1st ACM International Health Informatics Symposium. IHI 2010, pp. 751–757 (2010)
Google Scholar
Yetano, J., Alberola, V.: Diccionario de Siglas Médicas y Otras Abreviaturas, Epónimos y Términos Médicos Relacionados con la Codificación de las Altas Hospitalarias. Ministerio de Sanidad y Consumo (2003)
Google Scholar
Kim, J.D., Pysalo, S., Ohta, T., Bossy, R., Nguyen, N., Tsujii, J.: Overview of BioNLP Shared Task 2011. In: Proc. of BioNLP Shared Task 2011. ACL (2011)
Google Scholar
Agirre, E., Soroa, A., Stevenson, M.: Graph-based word sense disambiguation of biomedical documents. Bioinformatics 26, 2889–2896 (2010)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Departamento de Lenguajes y Sistemas Informáticos. IXA taldea, UPV-EHU, Spain
Maite Oronoz, Koldo Gojenola & Alicia Perez
Departamento de Electricidad y Electrónica. IXA taldea, UPV-EHU, Spain
Arantza Casillas

Authors

Maite Oronoz
View author publications
You can also search for this author in PubMed Google Scholar
Arantza Casillas
View author publications
You can also search for this author in PubMed Google Scholar
Koldo Gojenola
View author publications
You can also search for this author in PubMed Google Scholar
Alicia Perez
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Advanced Technologies Application Center (CENATAV), 7a A#21406 esq. 214 y 216, Rpto. Siboney, Playa, C.P. 12200, La Habana, Cuba
José Ruiz-Shulcloper
Institute of Cybernetics “E. Caianiello”, National Research Council (CNR), Via Campi Flegrei 34, 80078, Pozzuoli, Naples, Italy
Gabriella Sanniti di Baja

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Oronoz, M., Casillas, A., Gojenola, K., Perez, A. (2013). Automatic Annotation of Medical Records in Spanish with Disease, Drug and Substance Names. In: Ruiz-Shulcloper, J., Sanniti di Baja, G. (eds) Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications. CIARP 2013. Lecture Notes in Computer Science, vol 8259. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-41827-3_67

Download citation

DOI: https://doi.org/10.1007/978-3-642-41827-3_67
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-41826-6
Online ISBN: 978-3-642-41827-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The International Association for Pattern Recognition (opens in a new tab)

Automatic Annotation of Medical Records in Spanish with Disease, Drug and Substance Names

Abstract

Chapter PDF

Similar content being viewed by others

Identifying named entities from PubMed® for enriching semantic categories

SIFR annotator: ontology-based semantic annotation of French biomedical text and clinical notes

Detecting Named Entities and Relations in German Clinical Reports

Index Terms

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Societies and partnerships

Navigation

Automatic Annotation of Medical Records in Spanish with Disease, Drug and Substance Names

Abstract

Chapter PDF

Similar content being viewed by others

Identifying named entities from PubMed® for enriching semantic categories

SIFR annotator: ontology-based semantic annotation of French biomedical text and clinical notes

Detecting Named Entities and Relations in German Clinical Reports

Index Terms

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Societies and partnerships

Search

Navigation