ITC-irst at CLEF 2003: Monolingual, Bilingual, and Multilingual Information Retrieval
This paper reports on the participation of ITC-irst in the 2003 campaign of the Cross Language Evaluation Forum for the monolingual, bilingual, and small multilingual tasks. The languages considered were English, French, German, Italian, and Spanish. With respect to the ITC-irst system presented at CLEF 2002, the statistical models for bilingual document retrieval have been improved, more languages have been considered, and a novel multilingual information retrieval system has been developed, which combines several bilingual retrieval models into a statistical framework. As in the last CLEF, bilingual models integrate retrieval and translation scores over the set of N-best translations of the source query.
KeywordsTarget Language Retrieval Performance Source Language Bilingual Dictionary Compound Noun
Unable to display preview. Download preview PDF.
- 2.Federico, M., Bertoldi, N.: Statistical cross-language information retrieval using n-best query translations. In: Proceedings of the 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Tampere, Finland, pp. 167–174 (2002)Google Scholar
- 5.Rabiner, L.R.: A tutorial on hidden Markov models and selected applications in speech recognition. In: Weibel, A., Lee, K. (eds.) Readings in Speech Recognition, pp. 267–296. Morgan Kaufmann, San Francisco (1990)Google Scholar
- 7.Frakes, W.B., Baeza-Yates, R. (eds.): Information Retrieval: Data Structures and Algorithms. Prentice Hall, Englewood Cliffs (1992)Google Scholar
- 8.Johnson, S., Jourlin, P., Jones, K.S., Woodland, P.: Spoken document retrieval for TREC-8 at Cambridge University. In: Proceedings of the 8th Text REtrieval Conference, Gaithersburg, MD (1999)Google Scholar