Abstract
Information about medications is critical in supporting decision-making during the prescription process and thus in improving the safety and quality of care. The Summary of Product Characteristics (SPC) represents the basis of information for health professionals on how to use medicines. However, this information is locked in free-text and, as such, cannot be actively accessed and elaborated by computerized applications. In this work, we propose a machine learning based system for the automatic recognition of drug-related entities (active ingredient, interaction effects, etc.) in SPCs, focusing on drug interactions. Our approach learns to classify this information in a structured prediction framework, relying on conditional random fields. The classifier is trained and evaluated using a corpus of a hundred SPCs. They have been hand-annotated with thirteen semantic labels that have been derived from a previously developed domain ontology. Our evaluations show that the model exhibits high overall performance, with an average F1-measure of about 90%.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Evans, D.A., Brownlowt, N.D., Hersh, W.R., Campbell, E.M.: Automating concept identification in the electronic medical record: An experiment in extracting dosage information. In: Proc. AMIA Annu. Fall Symp., pp. 388–392 (1996)
Friedman, C., Jonhson, S.B., Forman, B., Stanner, J.: Architectural requirements for a multipurpose natural language processor in the clinical environment. In: Proc. Annu. Symp. Comput. Appl. Med. Care, pp. 347–351 (1995)
Gold, S., Elhadad, N., Zhu, X., Cimino, J.J., Hripcsak, G.: Extracting structured medication event information from discharge summaries. In: Proc. AMIA Annu. Symp., pp. 237–241 (2008)
Haug, P.J., Ranum, D.L., Frederick, P.R.: Computerized extraction of coded findings from free-text radiologic reports. Radiology 174(2), 543–548 (1990)
Hripcsak, G., Kuperman, G.J., Friedman, C.: Extracting findings from narrative reports: software transferability and sources of physician disagreement. Methods Inf. Med., 1–7 (1998)
Lafferty, J.D., McCallum, A., Pereira, F.C.N.: Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In: Proc. International Conference on Machine Learning, vol. 18, pp. 282–289 (2001)
Leape, L.L., Bates, D.W., Cullen, D.J., Cuper, J.: System analysis of adverse drug events. Journal of the American Medical Association 274(1), 35–43 (1995)
Levin, M.A., Krol, M., Doshi, A.M., Reich, D.L.: Extraction and mapping of drug names from free text to a standardized nomenclature. In: Proc. AMIA Annu. Symp., pp. 438–442 (2007)
McCallum, A.: Mallet: A machine learning for language toolkit. Tech. rep. (2002), http://mallet.cs.umass.edu
McCarry, A.T., Sponsler, J.L., Brylawski, B., Browne, A.C.: The role of a lexical knowledge in biomedical text understanding. In: Proc. SCAMC, pp. 103–104 (1987)
Meystre, S.M., Savova, G.K., Kipper-Schuler, K.C., Hurdle, J.F.: Extracting information from textual documents in the electronic health record: A review of recent research. In: IMIA Yearbook of Medical Informatics, pp. 128–144 (2008)
Pereira, S., Plaisantin, B., Korchia, M., Rozanes, N., Serrot, E., Joubert, M., Darmoni, S.J.: Automatic construction of dictionaries, application to product characteristics indexing. In: Proc Workshop on Advances in Bio Text Mining (2010)
Rubrichi, S., Leonardi, G., Quaglini, S.: A drug ontology as a basis for safe therapeutic decision. In: Proc. Second National Conference of Bioengineering, Patron (2010)
Sager, N., Friedman, C., Chi, E.: The analysis and processing of clinical narrative. In: Salamon, R., Blum, B., Jorgensen, M. (eds.) Proc. fifth Conference on Medical Informatics, pp. 1101–1105. Elsevier, Amsterdam (1986)
Segura-Bedmar, I., Martínez, P., de Pablo-Sánchez, C.: Extracting drug-drug interactions from biomedical texts. In: Workshop on Advances in Bio Text Mining (2010)
Shah, A.D., Martinez, C.: An algorithm to derive a numerical daily dose from unstructured text dosage instructions. Pharmacoepidemiology and Drug Safety 15, 161–166 (2006)
Sirohi, E., Peissig, P.: Study of effect of drug lexicons on medication extraction from electronic medical records. In: Proc. Pacific Symposium on Biocomputing, vol. 10, pp. 308–318 (2005)
Van Rijsbergen, C.J.: Information Retrieval. Department of Computer Science, University of Glasgow (1979)
Xu, H., Stenner, S.P., Doan, S., Johnson, K.B., Waitman, L.R., Denny, J.C.: Medex: a medication information extraction system for clinical narratives. Journal of the American Medical Informatics Association 17, 19–24 (2010)
Zanchetta, E., Baroni, M.: Morph-it!: a free corpus-based morphological resource for the italian language. In: Proc. Corpus Linguistics 2005 Conference (2005)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Rubrichi, S., Quaglini, S., Spengler, A., Gallinari, P. (2011). Extracting Information from Summary of Product Characteristics for Improving Drugs Prescription Safety. In: Peleg, M., Lavrač, N., Combi, C. (eds) Artificial Intelligence in Medicine. AIME 2011. Lecture Notes in Computer Science(), vol 6747. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-22218-4_42
Download citation
DOI: https://doi.org/10.1007/978-3-642-22218-4_42
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-22217-7
Online ISBN: 978-3-642-22218-4
eBook Packages: Computer ScienceComputer Science (R0)