ENSM-SE at INEX 2009 : Scoring with Proximity and Semantic Tag Information

Beigbeder, Michel; Imafouo, Amélie; Mercier, Annabelle

doi:10.1007/978-3-642-14556-8_6

Michel Beigbeder¹⁹,
Amélie Imafouo¹⁹ &
Annabelle Mercier²⁰

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 6203))

Included in the following conference series:

International Workshop of the Initiative for the Evaluation of XML Retrieval

549 Accesses
2 Citations

Abstract

We present in this paper some experiments on the Wikipedia collection used in the INEX 2009 evaluation campaign with an information retrieval method based on proximity. The idea of the method is to assign to each position in the document a fuzzy proximity value depending on its closeness to the surrounding keywords. These proximity values can then be summed on any range of text – including any passage or any element – and after normalization this sum is used as the relevance score for the extent. To take into account the semantic tags, we define a contextual operator which allow to consider at query time only the occurrences of terms that appear in a given semantic context.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Mitchell, P.C.: A note about the proximity operators in information retrieval. In: Proceedings of the 1973 meeting on Programming languages and information retrieval, pp. 177–180. ACM Press, New York (1974)
Google Scholar
Mitra, M., Buckley, C., Singhal, A., Cardie, C.: An analysis of statistical and syntactic phrases. In: Proceedings of RIAO 1997, 5th International Conference “Recherche d’Information Assistee par Ordinateur”, pp. 200–214 (1997)
Google Scholar
Rasolofo, Y., Savoy, J.: Term proximity scoring for keyword-based retrieval systems. In: Sebastiani, F. (ed.) ECIR 2003. LNCS, vol. 2633, pp. 207–218. Springer, Heidelberg (2003)
Chapter Google Scholar
Büttcher, S., Clarke, C.L.A., Lushman, B.: Term proximity scoring for ad-hoc retrieval on very large text collections. In: ACM SIGIR ’06, pp. 621–622. ACM, New York (2006)
Chapter Google Scholar
Song, R., Taylor, M.J., Wen, J.R., Hon, H.W., Yu, Y.: Viewing term proximity from a different perspective. In: Macdonald, C., Ounis, I., Plachouras, V., Ruthven, I., White, R.W. (eds.) ECIR 2008. LNCS, vol. 4956, pp. 346–357. Springer, Heidelberg (2008)
Chapter Google Scholar
Vechtomova, O., Karamuftuoglu, M.: Lexical cohesion and term proximity in document ranking. Information Processing and Management 44(4), 1485–1502 (2008)
Article Google Scholar
Hearst, M.A.: Improving full-text precision on short queries using simple constraints. In: Proceedings of the 5th Annual Symposium on Document Analysis and Information Retrieval (SDAIR), pp. 217–232 (1996)
Google Scholar
Clarke, C.L.A., Cormack, G.V., Burkowski, F.J.: Shortest substring ranking (multitext experiments for TREC-4). [13]
Google Scholar
Hawking, D., Thistlewaite, P.: Proximity operators - so near and yet so far. [13]
Google Scholar
Clarke, C.L.A., Cormack, G.V.: Shortest-substring retrieval and ranking. ACM Transactions on Information Systems 18(1), 44–78 (2000)
Article Google Scholar
de Kretser, O., Moffat, A.: Effective document presentation with a locality-based similarity heuristic. In: ACM SIGIR ’99, pp. 113–120. ACM, New York (1999)
Chapter Google Scholar
Beigbeder, M., Mercier, A.: An information retrieval model using the fuzzy proximity degree of term occurences. In: Liebrock, L.M. (ed.) SAC 2005: Proceedings of the 2005 ACM symposium on Applied computing. ACM Press, New York (2005)
Google Scholar
Harman, D.K. (ed.): The Fourth Text REtrieval Conference (TREC-4). Number 500-236, Department of Commerce, National Institute of Standards and Technology (1995)
Google Scholar

Download references

Author information

Authors and Affiliations

École Nationale Supérieure des Mines de Saint-Étienne, 158, cours Fauriel, F-42023, Saint Etienne Cedex 2, France
Michel Beigbeder & Amélie Imafouo
LCIS Lab - Grenoble University, 50, rue Barthelemy de Laffemas, BP 54 - F-26902, Valence Cedex 9, France
Annabelle Mercier

Authors

Michel Beigbeder
View author publications
You can also search for this author in PubMed Google Scholar
Amélie Imafouo
View author publications
You can also search for this author in PubMed Google Scholar
Annabelle Mercier
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Faculty of Science and Technology, Queensland University of Technology, GPO Box 2434, 4001, Brisbane, Qld, Australia
Shlomo Geva
Archives and Information Studies/Humanities, University of Amsterdam, Turfdraagsterpad 9, 1012 XT, Amsterdam, The Netherlands
Jaap Kamps
Department of Computer Science, University of Otago, P.O. Box 56,, 9054, Dunedin, New Zealand
Andrew Trotman

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Beigbeder, M., Imafouo, A., Mercier, A. (2010). ENSM-SE at INEX 2009 : Scoring with Proximity and Semantic Tag Information. In: Geva, S., Kamps, J., Trotman, A. (eds) Focused Retrieval and Evaluation. INEX 2009. Lecture Notes in Computer Science, vol 6203. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-14556-8_6

Download citation

DOI: https://doi.org/10.1007/978-3-642-14556-8_6
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-14555-1
Online ISBN: 978-3-642-14556-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics