Skip to main content

Report on CLEF 2002 Experiments: Combining Multiple Sources of Evidence

  • Conference paper
Advances in Cross-Language Information Retrieval (CLEF 2002)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2785))

Included in the following conference series:

Abstract

In our second participation in the CLEF retrieval tasks, our first objective was to propose better and more general stopword lists for various European languages (namely, French, Italian, German, Spanish and Finnish) along with improved, simpler and efficient stemming procedures. Our second goal was to propose a combined query-translation approach that could cross language barriers and also an effective merging strategy based on logistic regression for accessing the multilingual collection. Finally, within the Amaryllis experiment, we wanted to analyze how a specialized thesaurus might improve retrieval effectiveness.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Savoy, J.: Report on CLEF-2001 Experiments: Effective Combined Query-Translation Approach. In: Peters, C., Braschler, M., Gonzalo, J., Kluck, M. (eds.): Evaluation of Cross-Language Information Retrieval Systems. Lecture Notes in Computer Science, Vol. 2406. Springer-Verlag, Berlin Heidelberg New York (2002) 27–43 66, 69

    Google Scholar 

  2. Robertson, S.E., Walker, S., Beaulieu, M.: Experimentation as a Way of Life: Okapi at TREC. Information Processing & Management 36 (2000) 95–108 69, 73

    Article  Google Scholar 

  3. Fox, C.: A Stop List for General Text. ACM-SIGIR Forum 24 (1999) 19–35 69

    Article  Google Scholar 

  4. Savoy, J.: A Stemming Procedure and Stopword List for General French Corpora. Journal of the American Society for Information Science 50 (1999) 944–952 69

    Article  Google Scholar 

  5. Sproat, R.: Morphology and Computation. The MIT Press, Cambridge (1992) 69

    Google Scholar 

  6. Lovins, J.B.: Development of a Stemming Algorithm. Mechanical Translation and Computational Linguistics 11 (1968) 22–31 69

    Google Scholar 

  7. Porter, M. F.: An Algorithm for Suffix Stripping. Program 14 (1980) 130–137 69

    Article  Google Scholar 

  8. Figuerola, C.G., Gómez, R., Zazo Rodríguez, A. F., Berrocal, J.L.A.: Spanish Monolingual Track: The Impact of Stemming on Retrieval. In: Peters, C., Braschler, M., Gonzalo, J., Kluck, M. (eds.): Evaluation of Cross-Language Information Retrieval Systems. Lecture Notes in Computer Science, Vol. 2406. Springer-Verlag, Berlin Heidelberg New York (2002) 253–261 69

    Google Scholar 

  9. Kraaij, W., Pohlmann, R.: Viewing Stemming as Recall Enhancement. In Proceedings of the ACM-SIGIR 1996. The ACM Press, New York (1995) 40–48 70

    Google Scholar 

  10. Monz, C., de Rijke, M.: Shallow Morphological Analysis in Monolingual Information Retrieval for Dutch, German, and Italian. In: Peters, C., Braschler, M., Gonzalo, J., Kluck, M. (eds.): Evaluation of Cross-Language Information Retrieval Systems. Lecture Notes in Computer Science, Vol. 2406. Springer-Verlag, Berlin Heidelberg New York (2002) 262–277 70

    Google Scholar 

  11. Chen, A.: Multilingual Information Retrieval Using English and Chinese Queries. In: Peters, C., Braschler, M., Gonzalo, J., Kluck, M. (eds.): Evaluation of Cross-Language Information Retrieval Systems. Lecture Notes in Computer Science, Vol. 2406. Springer-Verlag, Berlin Heidelberg New York (2002) 44–58 70

    Google Scholar 

  12. Molina-Salgado, H., Moulinier, I., Knutson, M., Lund, E., Sekhon, K.: Thomson Legal and Regulatory at CLEF 2001: Monolingual and Bilingual Experiments. In: Peters, C., Braschler, M., Gonzalo, J., Kluck, M. (eds.): Evaluation of Cross-Language Information Retrieval Systems. Lecture Notes in Computer Science, Vol. 2406. Springer-Verlag, Berlin Heidelberg New York (2002) 226–234 70

    Google Scholar 

  13. Chen, A.: Cross-Language Retrieval Experiments at CLEF 2002. In this volume 72

    Google Scholar 

  14. Buckley, C., Singhal, A., Mitra, M., Salton, G.: New Retrieval Approaches Using SMART. In Proceedings TREC-4. NIST, Gaithersburg (1996) 25-48 73, 74

    Google Scholar 

  15. Singhal, A., Choi, J., Hindle, D., Lewis, D.D., Pereira, F.: AT&T at TREC-7. In Proceedings TREC-7. NIST, Gaithersburg (1999) 239-251 73

    Google Scholar 

  16. Braschler, M., Peters, C.: CLEF 2002: Methodology and Metrics. In this volume 73

    Google Scholar 

  17. Brand, R., Brünner, M.: Océ at CLEF 2002. In this volume 73

    Google Scholar 

  18. McNamee, P., Mayfield, J., Piatko, C.: A Language-Independent Approach to European Text Retrieval. In: Peters, C. (ed.): Cross-Language Information Retrieval and Evaluation. Lecture Notes in Computer Science, Vol. 2069. Springer-Verlag, Berlin Heidelberg New York (2001) 131–139 73

    Google Scholar 

  19. McNamee, P., Mayfield, J.: JHU/APL Experiments at CLEF: Translation Resources and Score Normalization. In: Peters, C., Braschler, M., Gonzalo, J., Kluck, M. (eds.): Evaluation of Cross-Language Information Retrieval Systems. Lecture Notes in Computer Science, Vol. 2406. Springer-Verlag, Berlin Heidelberg New York (2002) 193–208 73

    Google Scholar 

  20. Savoy, J.: Cross-Language Information Retrieval: Experiments Based on CLEF-2000 Corpora. Information Processing & Management (2002) to appear 74, 82

    Google Scholar 

  21. Amati, G., Carpineto, C., Romano, G.: Italian Monolingual Information Retrieval with PROSIT. In this volume 74

    Google Scholar 

  22. Nie, J. Y., Simard, M.: Using Statistical Translation Models for Bilingual IR. In: Peters, C., Braschler, M., Gonzalo, J., Kluck, M. (eds.): Evaluation of Cross-Language Information Retrieval Systems. Lecture Notes in Computer Science, Vol. 2406. Springer-Verlag, Berlin Heidelberg New York (2002) 137–150 77

    Google Scholar 

  23. Gachot, D.A., Lange, E., Yang, J.: The SYSTRAN NLP Browser: An Application of Machine Translation Technology. In: Grefenstette, G. (ed.): Cross-Language Information Retrieval. Kluwer, Boston (1998) 105–118

    Chapter  Google Scholar 

  24. Voorhees, E.M., Gupta, N. K., Johnson-Laird, B.: The Collection Fusion Problem. In Proceedings of TREC-3. NIST, Gaithersburg (1995) 95-104 81

    Google Scholar 

  25. Kwok, K.L., Grunfeld, L., Lewis, D.D.: TREC-3 Ad-hoc, Routing Retrieval and Thresholding Experiments Using PIRCS. In Proceedings of TREC-3. NIST, Gaithersburg (1995) 247-255 82

    Google Scholar 

  26. Dumais, S.T.: Latent Semantic Indexing (LSI) and TREC-2. In Proceedings of TREC-2. NIST, Gaithersburg (1994) 105-115 82

    Google Scholar 

  27. Powell, A.L., French, J.C., Callan, J., Connell, M., Viles, C.L.: The Impact of Database Selection on Distributed Searching. In Proceedings of ACM-SIGIR’2000. The ACM Press, New York (2000) 232–239 82

    Google Scholar 

  28. Flury, B.: A First Course in Multivariate Statistics. Springer, New York (1997) 82

    Book  MATH  Google Scholar 

  29. Hosmer, D. W., Lemeshow, S.: Applied Logistic Regression. 2nd edn. John Wiley, New York (2000) 82

    Book  MATH  Google Scholar 

  30. Le Calvé, A., Savoy, J.: Database Merging Strategy Based on Logistic Regression. Information Processing & Management, 36 (2000) 341–359 82

    Article  Google Scholar 

  31. Venables, W.N., Ripley, B.D.: Modern Applied Statistics with S-PLUS. Springer, New York (1999) 82

    Book  MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2003 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Savoy, J. (2003). Report on CLEF 2002 Experiments: Combining Multiple Sources of Evidence. In: Peters, C., Braschler, M., Gonzalo, J., Kluck, M. (eds) Advances in Cross-Language Information Retrieval. CLEF 2002. Lecture Notes in Computer Science, vol 2785. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-45237-9_6

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-45237-9_6

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-40830-7

  • Online ISBN: 978-3-540-45237-9

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics