Advertisement

SINAI at CLEF 2003: Decompounding and Merging

  • Fernando Martínez-Santiago
  • Arturo Montejo-Ráez
  • Luis Alfonso Ureña-López
  • M. Carlos Díaz-Galiano
Part of the Lecture Notes in Computer Science book series (LNCS, volume 3237)

Abstract

This paper describes the application of the two-step RSV and mixed two-step RSV merging methods in the multilingual-4 and multilingual-8 tasks at CLEF 2003. We study the performance of these methods compared to previous studies and approaches. A new strategy for dealing with compound words which uses predefined vocabularies for automatic decomposition is also presented and evaluated.

Keywords

Machine Translation Query Term Document Frequency Compound Word Original Query 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Chen, A.: Multilingual Information Retrieval Using English and Chinese Queries. In: Peters, C., Braschler, M., Gonzalo, J., Kluck, M. (eds.) CLEF 2001. LNCS, vol. 2406, pp. 44–58. Springer, Heidelberg (2002)CrossRefGoogle Scholar
  2. 2.
    Martínez-Santiago, F., Martín, M., Ureña, L.: SINAI at CLEF 2002: Experiments with merging strategies. In: Peters, C. (ed.) Proceedings of the CLEF 2002 Cross-Language Text Retrieval System Evaluation Campaign. LNCS, vol. 2785, pp. 187–198. Springer, Heidelberg (2003)Google Scholar
  3. 3.
    Savoy, J.: Report on CLEF-2001 Experiments. In: Peters, C. (ed.) CLEF 2001. LNCS, vol. 2406, pp. 27–43. Springer, Heidelberg (2003)CrossRefGoogle Scholar
  4. 4.
    Gey, F., Jiang, H., Chen, A., Larson, R.: Manual Queries and Machine Translation in Cross-language Retrieval and Interactive Retrieval with Cheshire II at TREC-7. In: Voorhees, E.M., Harman, D.K. (eds.) Proceedings of the Seventh Text REtrieval Conference (TREC-7), pp. 527–540 (2000)Google Scholar
  5. 5.
    Nie, J., Jin, F.: Merging Different Languages in a Single Document Collection. In: Peters, C. (ed.) CLEF 2001. LNCS, vol. 2406, pp. 59–62. Springer, Heidelberg (2003)Google Scholar
  6. 6.
    McNamee, P., Mayfield, J.: JHU/APL Experiments at CLEF: Translation Resources and Score Normalization. In: Peters, C., Braschler, M., Gonzalo, J., Kluck, M. (eds.) CLEF 2001. LNCS, vol. 2406, pp. 193–208. Springer, Heidelberg (2002)CrossRefGoogle Scholar
  7. 7.
    Calvé, L.A., Savoy, J.: Database merging strategy based on logistic regression. Information Processing & Management 36, 341–359 (2000)CrossRefGoogle Scholar
  8. 8.
    Savoy, J.: Cross-language information retrieval: experiments based on CLEF 2000 corpora. Information Processing & Management 39, 75–115 (2003)Google Scholar
  9. 9.
    Martín, M., Martínez-Santiago, F., Ureña, L.: Aprendizaje neuronal aplicado a la fusión de colecciones multilingües en CLIR. Procesamiento del Lenguaje Natural (2003) (in press)Google Scholar
  10. 10.
    Martínez-Santiago, F., Ureña, L.: SINAI experience at CLEF. Revista Iberoamericana de Inteligencia Artificial (2003) (in press)Google Scholar
  11. 11.
    Martínez-Santiago, F., Ureña, L.: A merging strategy proposal: the 2-step retrieval status value method. Technical report, University of Jaén (2003)Google Scholar
  12. 12.
    Chen, A.: Cross-language Retrieval Experiments at CLEF-2002. In: Peters, C. (ed.) Proceedings of the CLEF 2002 Cross-Language Text Retrieval System Evaluation Campaign. LNCS, vol. 2785, pp. 5–20 (2003)Google Scholar
  13. 13.
    Hollink, V., Kamps, J., Monz, C., de Rijke, M.: Monolingual retrieval for European languages. Information Processing Retrieval 7(1-2) (2004)Google Scholar
  14. 14.
    Robertson, S.E., Walker, S., Beaulieu, M.: Experimentation as a way of life: Okapi at TREC. Information Processing and Management 36(1)1, 95–108 (2000)Google Scholar
  15. 15.
    Harman, D.: Relevance feedback revisited. In: Proceedings of the 15th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 1992), pp. 1–10 (1992)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2004

Authors and Affiliations

  • Fernando Martínez-Santiago
    • 1
  • Arturo Montejo-Ráez
    • 2
  • Luis Alfonso Ureña-López
    • 1
  • M. Carlos Díaz-Galiano
    • 1
  1. 1.Dpto. Computer ScienceUniversity of JaénJaénSpain
  2. 2.Scientific Information ServiceEuropean Organization for Nuclear ResearchGenevaSwitzerland

Personalised recommendations