Skip to main content

SINAI at CLEF 2006 Ad Hoc Robust Multilingual Track: Query Expansion Using the Google Search Engine

  • Conference paper
Book cover Evaluation of Multilingual and Multi-modal Information Retrieval (CLEF 2006)

Abstract

This year, we have participated in the Ad-Hoc Robust Multilingual track with the aim of evaluating two important issues in Cross-Lingual Information Retrieval (CLIR) systems. This paper first describes the method applied for query expansion in a multilingual environment by using web search results provided by the Google engine in order to increase retrieval robustness. Unfortunately, the results obtained are disappointing. The second issue reported alludes to the robustness of several common merging algorithms. We have found that 2-step RSV merging algorithms perform better than others algorithms when evaluating using geometric average.

This work has been supported by the Spanish Government (MCYT) with grant TIC2003-07158-C04-04 and the RFC/PP2006/Id_514 granted by the University of Jaén.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Voorhees, E.M.: The TREC Robust Retrieval Track, TREC Report (2005)

    Google Scholar 

  2. Kwok, K.L., Grunfeld, L., Deng, P.: Improving Weak Ad-Hoc Retrieval by Web Assistance and Data Fusion. In: Lee, G.G., Yamada, A., Meng, H., Myaeng, S.-H. (eds.) AIRS 2005. LNCS, vol. 3689, pp. 17–30. Springer, Heidelberg (2005)

    Chapter  Google Scholar 

  3. Kwok, K.L., Grunfeld, L., Sun, H.L., Deng, P.: TREC 2004 Robust Track Experiments using PIRCS, 2004 (2005)

    Google Scholar 

  4. Grunfeld, L., Kwok, K.L., Dinstl, N., Deng, P.: TREC 2003 Robust, HARD and QA Track Experiments using PIRCS (2003)

    Google Scholar 

  5. Dumais, S.T.: Latent Semantic Indexing (LSI) and TREC-2. In: Harman, D.K. (ed.) Proceedings of TREC’2, Gaithersburg. NIST, vol. 500-215, pp. 105–115 (1994)

    Google Scholar 

  6. Martinez-Santiago, F., Ureña, L.A., Martin, M.: A merging strategy proposal: two step retrieval status value method. Information Retrieval 9(1), 71–93 (2006)

    Article  Google Scholar 

  7. Porter, M.F.: An algorithm for suffix stripping. Program 14, 130–137 (1980)

    Google Scholar 

  8. Robertson, S.E, Walker, S., Beaulieu, M.: Experimentation as a way of life: Okapi at TREC. Information Processing and Management 1, 95–108 (2000)

    Article  Google Scholar 

  9. Savoy, J.: Cross-Language Information Retrieval: experiments based on CLEF 2000 corpora. Information Processing and Management 39, 75–115 (2003)

    Article  MATH  Google Scholar 

  10. Llopis, F., Garcia Puigcerver, H., Cano, M., Toral, A., Espi, H.: IR-n System, a Passage Retrieval Architecture. In: Sojka, P., Kopeček, I., Pala, K. (eds.) TSD 2004. LNCS (LNAI), vol. 3206, pp. 57–64. Springer, Heidelberg (2004)

    Google Scholar 

  11. Callan, J.P., Lu, Z., Croft, W.B.: Searching distributed collections with inference networks. In: Proceedings of the 18th International Conference of the ACM SIGIR 1995, pp. 21–28. The ACM Press, New York (1995)

    Google Scholar 

  12. Calve, A., Savoy, J.: Database merging strategy based on logistic regression. Information Processing and Management 36, 341–359 (2000)

    Article  Google Scholar 

  13. Powell, A.L., French, J.C., Callan, J., Connell, M., Viles, C.L.: The impact of database selection on distributed searching. In: Proceedings of the 23rd International Conference of the ACM-SIGIR 2000, pp. 232–239. ACM Press, New York (2000)

    Google Scholar 

  14. Voorhees, E., Gupta, N.K., Johnson-Laird, B.: The collection fusion problem. In: Harman, D.K. (ed.) Proceedings of the 3rd Text Retrieval Conference TREC-3, National Institute of Standards ad Technology, Special Publication, vol. 500-225, pp. 95–104 (1995)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Carol Peters Paul Clough Fredric C. Gey Jussi Karlgren Bernardo Magnini Douglas W. Oard Maarten de Rijke Maximilian Stempfhuber

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Martínez-Santiago, F., Montejo-Ráez, A., García-Cumbreras, M.Á., Ureña-López, L.A. (2007). SINAI at CLEF 2006 Ad Hoc Robust Multilingual Track: Query Expansion Using the Google Search Engine. In: Peters, C., et al. Evaluation of Multilingual and Multi-modal Information Retrieval. CLEF 2006. Lecture Notes in Computer Science, vol 4730. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-74999-8_16

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-74999-8_16

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-74998-1

  • Online ISBN: 978-3-540-74999-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics