Skip to main content

ENSM-SE at INEX 2009 : Scoring with Proximity and Semantic Tag Information

  • Conference paper
Focused Retrieval and Evaluation (INEX 2009)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 6203))

Abstract

We present in this paper some experiments on the Wikipedia collection used in the INEX 2009 evaluation campaign with an information retrieval method based on proximity. The idea of the method is to assign to each position in the document a fuzzy proximity value depending on its closeness to the surrounding keywords. These proximity values can then be summed on any range of text – including any passage or any element – and after normalization this sum is used as the relevance score for the extent. To take into account the semantic tags, we define a contextual operator which allow to consider at query time only the occurrences of terms that appear in a given semantic context.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Mitchell, P.C.: A note about the proximity operators in information retrieval. In: Proceedings of the 1973 meeting on Programming languages and information retrieval, pp. 177–180. ACM Press, New York (1974)

    Google Scholar 

  2. Mitra, M., Buckley, C., Singhal, A., Cardie, C.: An analysis of statistical and syntactic phrases. In: Proceedings of RIAO 1997, 5th International Conference “Recherche d’Information Assistee par Ordinateur”, pp. 200–214 (1997)

    Google Scholar 

  3. Rasolofo, Y., Savoy, J.: Term proximity scoring for keyword-based retrieval systems. In: Sebastiani, F. (ed.) ECIR 2003. LNCS, vol. 2633, pp. 207–218. Springer, Heidelberg (2003)

    Chapter  Google Scholar 

  4. Büttcher, S., Clarke, C.L.A., Lushman, B.: Term proximity scoring for ad-hoc retrieval on very large text collections. In: ACM SIGIR ’06, pp. 621–622. ACM, New York (2006)

    Chapter  Google Scholar 

  5. Song, R., Taylor, M.J., Wen, J.R., Hon, H.W., Yu, Y.: Viewing term proximity from a different perspective. In: Macdonald, C., Ounis, I., Plachouras, V., Ruthven, I., White, R.W. (eds.) ECIR 2008. LNCS, vol. 4956, pp. 346–357. Springer, Heidelberg (2008)

    Chapter  Google Scholar 

  6. Vechtomova, O., Karamuftuoglu, M.: Lexical cohesion and term proximity in document ranking. Information Processing and Management 44(4), 1485–1502 (2008)

    Article  Google Scholar 

  7. Hearst, M.A.: Improving full-text precision on short queries using simple constraints. In: Proceedings of the 5th Annual Symposium on Document Analysis and Information Retrieval (SDAIR), pp. 217–232 (1996)

    Google Scholar 

  8. Clarke, C.L.A., Cormack, G.V., Burkowski, F.J.: Shortest substring ranking (multitext experiments for TREC-4). [13]

    Google Scholar 

  9. Hawking, D., Thistlewaite, P.: Proximity operators - so near and yet so far. [13]

    Google Scholar 

  10. Clarke, C.L.A., Cormack, G.V.: Shortest-substring retrieval and ranking. ACM Transactions on Information Systems 18(1), 44–78 (2000)

    Article  Google Scholar 

  11. de Kretser, O., Moffat, A.: Effective document presentation with a locality-based similarity heuristic. In: ACM SIGIR ’99, pp. 113–120. ACM, New York (1999)

    Chapter  Google Scholar 

  12. Beigbeder, M., Mercier, A.: An information retrieval model using the fuzzy proximity degree of term occurences. In: Liebrock, L.M. (ed.) SAC 2005: Proceedings of the 2005 ACM symposium on Applied computing. ACM Press, New York (2005)

    Google Scholar 

  13. Harman, D.K. (ed.): The Fourth Text REtrieval Conference (TREC-4). Number 500-236, Department of Commerce, National Institute of Standards and Technology (1995)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Beigbeder, M., Imafouo, A., Mercier, A. (2010). ENSM-SE at INEX 2009 : Scoring with Proximity and Semantic Tag Information. In: Geva, S., Kamps, J., Trotman, A. (eds) Focused Retrieval and Evaluation. INEX 2009. Lecture Notes in Computer Science, vol 6203. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-14556-8_6

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-14556-8_6

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-14555-1

  • Online ISBN: 978-3-642-14556-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics