Abstract
Medical words exhibit a rich and productive morphology. Morphological knowledge is therefore very important for any medical language processing application. We propose a simple and powerful method to acquire automatically such knowledge. It takes advantage of commonly available lists of synonym terms to bootstrap the acquisition process. We experimented it on the SNOMED International Microglossary for pathology in its French version. The families of morphologically related words that we obtained were useful for query expansion in a coding assistant. Since the method does not rely on a priori linguistic knowledge, it is applicable to other languages such as English.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Wingert, F., Rothwell, David, Côté, Roger A. Automated indexing into SNOMED and ICD. In: Scherrer, Jean Raoul, Côté, Roger A., Mandil, Salah H., eds, Computerised Natural Medical Language Processing for Knowledge Engineering. North-Holland, Amsterdam, 1989: 201–39.
Pacak, M. G., Norton, L. M., Dunham, G. S. Morphosemantic analysis of-ITIS forms in medical language. Methods Inf Med 1980; 19: 99–105.
Dujols, Pierre, Aubas, Pierre, Baylon, Christian, Grémy, François. Morphosemantic analysis and translation of medical compound terms. Methods Inf Med 1991; 30: 30–5.
McCray, Alexa T., Srinivasan, S., Browne, A. C. Lexical methods for managing variation in biomedical terminologies. In: Proc Eighteenth Annu Symp Comput Appl Med Care, Washington. Mc Graw Hill, 1994: 235–9.
Spyns, Peter. A robust category guesser for Dutch medical language. In: Proceedings of ANLP 94 (ACL), 1994: 150–5.
Lovis, Christian, Baud, Robert, Rassinoux, Anne-Marie, Michel, Pierre-André, Scherrer, Jean-Raoul. Medical dictionaries for patient encoding systems: a methodology. Artif Intell Med 1998; 14: 201–14.
Côté, Roger A. Répertoire d'anatomopathologie de la SNOMED internationale, v3.4. Université de Sherbrooke, Sherbrooke, Québec, 1996.
Lovis, Christian, Baud, Robert, Michel, Pierre-André, Scherrer, Jean-Raoul. A semi-automatic ICD encoder. J Am Med Inform Assoc 1996; 3(suppl): 937-.
Koskenniemi, Kimmo. Two-level morphology: a general computational model for word-form recognition and production. PhD thesis, University of Helsinki Department of General Linguistics, Helsinki, 1983.
Grabar, Natalia, Zweigenbaum, Pierre. Language-independent automatic acquisition of morphological knowledge from synonym pairs. TR 99-211, DIAM-SIM/AP-HP, 1999. Submitted to AMIA'99 Fall Symposium.
Xu, Jinxi, Croft, Bruce W. Corpus-based stemming using co-occurrence of word variants. ACM Transactions on Information Systems 1998; 16(1): 61–81.
Jacquemin, Christian. Guessing morphology from terms and corpora. In: Actes, 20th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR'97), Philadelphia, PA. 1997: 156–67.
Theron, Pieter, Cloete, Ian. Automatic acquisition of two-level morphological rules. In: ANLP97, Washington, DC. 1997: 103–10.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1999 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Zweigenbaum, P., Grabar, N. (1999). Automatic Acquisition of Morphological Knowledge for Medical Language Processing. In: Horn, W., Shahar, Y., Lindberg, G., Andreassen, S., Wyatt, J. (eds) Artificial Intelligence in Medicine. AIMDM 1999. Lecture Notes in Computer Science(), vol 1620. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-48720-4_46
Download citation
DOI: https://doi.org/10.1007/3-540-48720-4_46
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-66162-7
Online ISBN: 978-3-540-48720-3
eBook Packages: Springer Book Archive