Abstract
We report our web-based query generation experiments for English and French collections in the Robust task of the CLEF Ad-Hoc track. We continued with the approach adopted in the previous year, although the model has been modified. Last year we used Google to expand the original query. This year we create a new expanded query in addition to the original one. Thus, we retrieve two lists of relevant documents, one for each query (the original and the expanded one). In order to integrate the two lists of documents, we apply a logistic regression merging solution. The results obtained are discouraging but the failure analysis shows that very difficult queries are improved by using both queries instead of the original query. The problem is to decide when a query is very difficult.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Kwok, K.L., Grunfeld, L., Lewis, D.D.: TREC-3 ad-hoc, routing retrieval and thresholding experiments using PIRCS. In: Proceedings of TREC’3, vol. 500-215, pp. 247–255. NIST Special Publication (1995)
Martínez-Santiago, F., Montejo-Ráez, A., García-Cumbreras, M.A., Ureña-López, L.A.: SINAI at CLEF 2006 Ad Hoc Robust Multilingual Track: Query Expansion using the Google Search Engine Evaluation of Multilingual and Multi-modal Information Retrieval. In: Peters, C., Clough, P., Gey, F.C., Karlgren, J., Magnini, B., Oard, D.W., de Rijke, M., Stempfhuber, M. (eds.) CLEF 2006. LNCS, vol. 4730. Springer, Heidelberg (2007)
Voorhees, E., Gupta, N.K., Johnson-Laird, B.: The Collection Fusion Problem. In: Proceedings of the 3th Text Retrieval Conference TREC-3, vol. 500-225, pp. 95–104. NIST Special Publication (1995)
Martínez Santiago, F., Ureña López, L.A., Martín-Valdivia, M.T.: A merging strategy proposal: The 2-step retrieval status value method. Information Retrieval 9, 71–93 (2006)
Savoy, J.: Combining Multiple Strategies for Effective Cross-Language Retrieval. Information Retrieval 7, 121–148 (2004)
Robertson, S.E., Walker, S.: Okapi-Keenbow at TREC-8. In: Proceedings of the 8th Text Retrieval Conference TREC-8, vol. 500-246, pp. 151–162. NIST Special Publication (1999)
Calvé, A., Savoy, J.: Database merging strategy based on logistic regression. Information Processing & Management 36, 341–359 (2000)
Savoy, J.: Cross-Language information retrieval: experiments based on CLEF 2000 corpora. Information Processing & Management 39, 75–115 (2003)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Martínez-Santiago, F., Montejo-Ráez, A., García-Cumbreras, M.A. (2008). SINAI at CLEF Ad-Hoc Robust Track 2007: Applying Google Search Engine for Robust Cross-Lingual Retrieval. In: Peters, C., et al. Advances in Multilingual and Multimodal Information Retrieval. CLEF 2007. Lecture Notes in Computer Science, vol 5152. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-85760-0_18
Download citation
DOI: https://doi.org/10.1007/978-3-540-85760-0_18
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-85759-4
Online ISBN: 978-3-540-85760-0
eBook Packages: Computer ScienceComputer Science (R0)