Abstract
This paper describes our participation in the CLEF bilingual retrieval task (formulating queries in Spanish to retrieve documents in English), using an information retrieval (IR) system based on the vector model. Our aim was to use a simple approach to solve the problem, without expecting to obtain great results, especially owing to the short time available. The queries formulated in Spanish were translated to English by a commercial machine translation system. The translations were filtered to eliminate stop words, and then the remaining terms were stemmed using a standard stemmer. Results were poorer than those obtained through monolingual retrieval with original English queries, the difference being slightly over 15%.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Agirre, E., Atserias, J., Padró, L. and Rigau, G.: Combining Supervised and Unsupervised Lexical Knowledge Methods for Word Sense Disambiguation. Computers and the Humanities. Special Double Issue on SensEval. Eds. Martha Palmer and Adam Kilgarriff. 34:1,2, (2000). [http://www.lsi.upc.es/~nlp/papers/chum99-arpa.ps.gz]
Figuerola, C.G., Alonso Berrocal, J.L. and Zazo, A.F.: Diseño de un motor de recuperación para uso experimental y educativo. BiD: textos universitaris de biblioteconomia i documentació, 4 (2000) [http://www.ub.es/biblio/bid/04figue2.html]
Fluhr, C.: Multilingual Information Retrieval. In Cole, R. A. et al.: Survey of the State of the Art in Human Language Technology, Standford University, Stanford, CA,(1995) 391–305 [http://www.cse.ogi.edu/CSLU/HLTsurvey/ch8node7.html]
Hull, D.A. and Grefenstette, G.: Queryng Across Languages: A Dictionary-Based Approach to Multilingual Intormation Retrieval. SIGIR96 (1996) 49–57
Kowalski, G.: Information Retrieval Systems-Theory and implementation. Kluwer Academic Publishers (1997)
Oard, D. and Dorr, B.J.: A Survey of Multilingual Text Retrieval. (1996) [http://www.clis.umd.edu/dlrg/filter/papers/mlir.ps]
Phillips, I.: Porter’s stemming algorithm. Perl script. http://www.perl.com/CPAN-local/authors/Ian_Phillipps/Stem-0.1.tar.gz
Porter, M.F.: An algorithm for suffix stripping. Program, 14(3) (1980) 130–137
Systran Software: SYSTRAN-Translation Technologies, Language Translator, Online dictionary, Translate English (2000) [http://www.systransoft.com]
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2001 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Figuerola, C.G., Alonso Berrocal, J.L., Zazo, A.F., Gómez Díaz, R. (2001). A Simple Approach to the Spanish-English Bilingual Retrieval Task. In: Peters, C. (eds) Cross-Language Information Retrieval and Evaluation. CLEF 2000. Lecture Notes in Computer Science, vol 2069. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44645-1_21
Download citation
DOI: https://doi.org/10.1007/3-540-44645-1_21
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-42446-8
Online ISBN: 978-3-540-44645-3
eBook Packages: Springer Book Archive