Skip to main content

Reusing Translated Terms to Expand a Multilingual Thesaurus

  • Conference paper
  • First Online:
Machine Translation and the Information Soup (AMTA 1998)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 1529))

Included in the following conference series:

  • 681 Accesses

Abstract

Multilingual thesauri play a key role in multilingual text retrieval. At present, only a small number of on-line thesauri contain translations of terms in languages other than English. This is the case of the Unified Medical Language System (UMLS) Metathesaurus that includes the same term in different languages (e.g., English and Spanish). However, only a subset of terms in English have a corresponding translation in Spanish. In this work, I present an approach and some experimental results for reusing translated terms to expand the Metathesaurus. The approach includes two main tasks: finding patterns and formulating rules to automate the translation of English terms into Spanish terms. The approach is based on pattern matching, morphological rules, and word order inversion.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Soergel, D.: Multilingual thesauri in cross-language text and speech retrieval. AAAI Symposium on Cross-Language Text and Speech Retrieval (1997)

    Google Scholar 

  2. Oard, D.W.: Alternative Approaches for Cross-Language Text Retrieval. AAAI Symposium on Cross-Language Text and Speech Retrieval (1997)

    Google Scholar 

  3. Sheridan, P., Ballerini, J.P. Experiments in Multilingual Information Retrieval using the SPIDER system. Proceedings of SIGIR-96 (1996)

    Google Scholar 

  4. Yang, Y., Brown, R., Frederking, R., Carbonell, J., Geng, Y., Lee, D. Bilingual-corpus Based Approaches to Translingual Information Retrieval. The 2nd Workshop on Multilinguality in Software Industry: The AI Contribution (1997)

    Google Scholar 

  5. Ballesteros, L., Croft, W.B. Phrasal translation and query expansion techniques for cross-language information retrieval. AAAI Symposium on Cross-Language Text and Speech Retrieval (1997)

    Google Scholar 

  6. Stamatos, E., Michos, S., Pantelodimou, C., Fakotakis, N. TRANSLIB: An Advanced Tool for Supporting Multilingual Access to Library Catalogues. The 2nd Workshop on Multilinguality in Software Industry: The AI Contribution (1997)

    Google Scholar 

  7. Gilarranz, J., Gonzalo, J., Verdejo, F. Language-independent Text Retrieval with the EuroWordNet Multilingual Semantic Database. The 2nd Workshop on Multi-linguality in Software Industry: The AI Contribution (1997)

    Google Scholar 

  8. Nkwenti-Azeh, B. New trends in terminology processing and implications for practical translation. ASLIB (1994) 67–74

    Google Scholar 

  9. National Library of Medicine. Unified Medical Language System. 8th Edition 1997. UMLS Knowledge Sources, Documentation (1997)

    Google Scholar 

  10. Klemettinen, M., Mannila, H., Toivonen, H. A Data Mining Methodology and Its Application to Semi-Automatic Knowledge Acquisition. DEXA’97 Workshop (1997)

    Google Scholar 

  11. Fayyad, U., Piatetsky-Shapiro, G., Smyth, P. From data mining to knowledge discovery: An overview. Advances in Knowledge Discovery and Data Mining (1996) 1–34

    Google Scholar 

  12. Brazma, A., Jonassen, I., Eidhammer, I., Gilbert, D. R. Approaches to automatic discovery of patterns in biosequences. Reports in Informatics 113 (1995)

    Google Scholar 

  13. Jonassen, I., Collins, J. F., Higgins, D. G. Finding flexible patterns in unaligned protein sequences. Protein Science 4(8) (1995) 1587–1595

    Article  Google Scholar 

  14. Jonassen, I. Efficient discovery of conserved patterns using a pattern graph CABIOS (1997)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 1998 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Guillén, R. (1998). Reusing Translated Terms to Expand a Multilingual Thesaurus. In: Farwell, D., Gerber, L., Hovy, E. (eds) Machine Translation and the Information Soup. AMTA 1998. Lecture Notes in Computer Science(), vol 1529. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-49478-2_34

Download citation

  • DOI: https://doi.org/10.1007/3-540-49478-2_34

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-65259-5

  • Online ISBN: 978-3-540-49478-2

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics