Advertisement

SENSE: SEmantic N-levels Search Engine at CLEF2008 Ad Hoc Robust-WSD Track

  • Annalina Caputo
  • Pierpaolo Basile
  • Giovanni Semeraro
Part of the Lecture Notes in Computer Science book series (LNCS, volume 5706)

Abstract

This paper presents the results of the experiments conducted at the University of Bari for the Ad Hoc Robust-WSD track of the Cross-Language Evaluation Forum (CLEF) 2008. The evaluation was performed using SENSE (SEmantic N-levels Search Engine), a semantic search engine that tries to overcome the limitations of the ranked keyword approach by introducing semantic levels, which integrate (and not simply replace) the lexical level represented by keywords.

We show how SENSE is able to manage documents indexed at two separate levels, keyword and word meaning, in an attempt of improving the retrieval performance.

Two types of experiments have been performed by exploiting both only one indexing level and all indexing levels at the same time. The experiments performed combining keywords and word meanings, extracted from the WordNet lexical database, show the promise of the idea and point out the value of our institution.

In particular the results confirm our hypothesis: The combination of two indexing levels outperforms a single level. Indeed, an improvement of 35% in precision has been obtained by adopting the N-levels model with respect to the results obtained by exploiting the indexing level based only on keywords.

Keywords

Query Expansion Information Retrieval System Indexing Level Word Sense Disambiguation Spanish Word 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Agirre, E., Di Nunzio, G.M., Ferro, N., Mandl, T., Peters, C.: Clef 2008: Ad hoc track overview. In: Abstracts of the CLEF 2008 Workshop (2008)Google Scholar
  2. 2.
    Basile, P., Caputo, A., Gentile, A.L., Degemmis, M., Lops, P., Semeraro, G.: Enhancing semantic search using n-levels document representation. In: Bloehdorn, S., Grobelnik, M., Mika, P., Tran, D.T. (eds.) SemSearch. CEUR Workshop Proceedings, vol. 334, pp. 29–43. CEUR-WS.org (2008)Google Scholar
  3. 3.
    Cohen, D., Amitay, E., Carmel, D.: Lucene and juru at trec 2007: 1-million queries track. In: Proceedings of the 16th Text REtrieval Conference (TREC 2007) (November 2007)Google Scholar
  4. 4.
    Fox, E.A., Shaw, J.A.: Combination of multiple searches. In: TREC, pp. 243–252 (1993)Google Scholar
  5. 5.
    Lee, J.-H.: Analyses of multiple evidence combination. In: SIGIR, pp. 267–276. ACM, New York (1997)CrossRefGoogle Scholar
  6. 6.
    Miller, G.A.: Wordnet: a lexical database for english. Commun. ACM 38(11), 39–41 (1995)CrossRefGoogle Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2009

Authors and Affiliations

  • Annalina Caputo
    • 1
  • Pierpaolo Basile
    • 1
  • Giovanni Semeraro
    • 1
  1. 1.Department of Computer ScienceUniversity of BariBariItaly

Personalised recommendations