Skip to main content

A Simple Approach to the Spanish-English Bilingual Retrieval Task

  • Conference paper
  • First Online:
Cross-Language Information Retrieval and Evaluation (CLEF 2000)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2069))

Included in the following conference series:

  • 316 Accesses

Abstract

This paper describes our participation in the CLEF bilingual retrieval task (formulating queries in Spanish to retrieve documents in English), using an information retrieval (IR) system based on the vector model. Our aim was to use a simple approach to solve the problem, without expecting to obtain great results, especially owing to the short time available. The queries formulated in Spanish were translated to English by a commercial machine translation system. The translations were filtered to eliminate stop words, and then the remaining terms were stemmed using a standard stemmer. Results were poorer than those obtained through monolingual retrieval with original English queries, the difference being slightly over 15%.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Agirre, E., Atserias, J., Padró, L. and Rigau, G.: Combining Supervised and Unsupervised Lexical Knowledge Methods for Word Sense Disambiguation. Computers and the Humanities. Special Double Issue on SensEval. Eds. Martha Palmer and Adam Kilgarriff. 34:1,2, (2000). [http://www.lsi.upc.es/~nlp/papers/chum99-arpa.ps.gz]

  2. Figuerola, C.G., Alonso Berrocal, J.L. and Zazo, A.F.: Diseño de un motor de recuperación para uso experimental y educativo. BiD: textos universitaris de biblioteconomia i documentació, 4 (2000) [http://www.ub.es/biblio/bid/04figue2.html]

  3. Fluhr, C.: Multilingual Information Retrieval. In Cole, R. A. et al.: Survey of the State of the Art in Human Language Technology, Standford University, Stanford, CA,(1995) 391–305 [http://www.cse.ogi.edu/CSLU/HLTsurvey/ch8node7.html]

    Google Scholar 

  4. Hull, D.A. and Grefenstette, G.: Queryng Across Languages: A Dictionary-Based Approach to Multilingual Intormation Retrieval. SIGIR96 (1996) 49–57

    Google Scholar 

  5. Kowalski, G.: Information Retrieval Systems-Theory and implementation. Kluwer Academic Publishers (1997)

    Google Scholar 

  6. Oard, D. and Dorr, B.J.: A Survey of Multilingual Text Retrieval. (1996) [http://www.clis.umd.edu/dlrg/filter/papers/mlir.ps]

  7. Phillips, I.: Porter’s stemming algorithm. Perl script. http://www.perl.com/CPAN-local/authors/Ian_Phillipps/Stem-0.1.tar.gz

  8. Porter, M.F.: An algorithm for suffix stripping. Program, 14(3) (1980) 130–137

    Google Scholar 

  9. Systran Software: SYSTRAN-Translation Technologies, Language Translator, Online dictionary, Translate English (2000) [http://www.systransoft.com]

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2001 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Figuerola, C.G., Alonso Berrocal, J.L., Zazo, A.F., Gómez Díaz, R. (2001). A Simple Approach to the Spanish-English Bilingual Retrieval Task. In: Peters, C. (eds) Cross-Language Information Retrieval and Evaluation. CLEF 2000. Lecture Notes in Computer Science, vol 2069. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44645-1_21

Download citation

  • DOI: https://doi.org/10.1007/3-540-44645-1_21

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-42446-8

  • Online ISBN: 978-3-540-44645-3

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics