Revisiting the Term Frequency in Concept-Based IR Models

Abdulahhad, Karam; Chevallet, Jean-Pierre; Berrut, Catherine

doi:10.1007/978-3-642-40285-2_8

Karam Abdulahhad²¹,
Jean-Pierre Chevallet²² &
Catherine Berrut²³

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 8055))

Included in the following conference series:

International Conference on Database and Expert Systems Applications

1759 Accesses
1 Citations

Abstract

Indexing documents and queries using concepts, instead of word-based indexing, is an alternative approach, and it supposes to give a more meaningful indexing. However, this way of indexing needs to revisit some hypotheses of classical Information Retrieval. Therefore, we propose a new concept weighting approach, namely Relative Weight, which weights concepts with respect to their corresponding text in the documents or queries. In other words, it assigns to each concept a relative weight with respect to the other concepts in the same context. We explore interesting experimental results of our new weighting approach, compared to the classical approaches, through studying the retrieval performance of some classical IR models.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Abdulahhad, K., Chevallet, J.-P., Berrut, C.: Solving Concept mismatch through Bayesian Framework by Extending UMLS Meta-Thesaurus. In: CORIA 2011, Avignon, France, pp. 311–326 (March 2011)
Google Scholar
Aronson, A.R.: Metamap: Mapping text to the umls metathesaurus (2006)
Google Scholar
Baziz, M.: Indexation conceptuelle guidée par ontologie pour la recherche d’information. Thèse de doctorat, Université Paul Sabatier, Toulouse, France (Décembre 2005)
Google Scholar
Bendersky, M., Metzler, D., Bruce Croft, W.: Parameterized concept weighting in verbose queries. In: SIGIR 2011, Beijing, China, pp. 605–614 (2011)
Google Scholar
Chevallet, J.-P., Lim, J.-H., Le., D.T.H.: Domain knowledge conceptual inter-media indexing: application to multilingual multimedia medical reports. In: CIKM 2007, Lisbon, Portugal, pp. 495–504 (2007)
Google Scholar
Codocedo, V., Lykourentzou, I., Napoli, A.: Semantic Indexing and Retrieval based on Formal Concept Analysis. Technical report (June 2012)
Google Scholar
Crestani, F.: Exploiting the similarity of non-matching terms at retrieval time. Inf. Retr. 2(1), 27–47 (2000)
Article Google Scholar
Dozier, C., Kondadadi, R., Al-Kofahi, K., Chaudhary, M., Guo, X.S.: Fast tagging of medical terms in legal text. In: ICAIL, pp. 253–260 (2007)
Google Scholar
Fang, H., Tao, T., Zhai, C.: A formal study of information retrieval heuristics. In: SIGIR 2004, Sheffield, United Kingdom, pp. 49–56 (2004)
Google Scholar
Le., T.H.D.: Utilisation de ressources externes dans un modèle Bayésien de Recherche d’Information. Application à la recherche d’information multilingue avec UMLS. These, Université Joseph-Fourier - Grenoble I (May 2009)
Google Scholar
Luhn, H.P.: The automatic creation of literature abstracts. IBM J. Res. Dev. 2(2), 159–165 (1958)
Article MathSciNet Google Scholar
Maisonnasse, L.: Les supports de vocabulaires pour les systèmes de recherche d’information orientés précision: application aux graphes pour la recherche d’information médicale. These, Université Joseph-Fourier - Grenoble I (May 2008)
Google Scholar
Ren, F., Bracewell, D.B.: Advanced information retrieval. Electron. Notes Theor. Comput. Sci. 225, 303–317 (2009)
Article Google Scholar
Robertson, S.E., Walker, S.: Some simple effective approximations to the 2-poisson model for probabilistic weighted retrieval. In: SIGIR 1994, Dublin, Ireland, pp. 232–241 (1994)
Google Scholar
Sanderson, M.: Word sense disambiguation and information retrieval. In: SIGIR 1994, Dublin, Ireland, pp. 142–151 (1994)
Google Scholar
Singhal, A., Buckley, C., Mitra, M.: Pivoted document length normalization. In: SIGIR 1996, Zurich, Switzerland, pp. 21–29 (1996)
Google Scholar
Smucker, M.D., Allan, J., Carterette, B.: A comparison of statistical significance tests for information retrieval evaluation. In: CIKM 2007, Lisbon, Portugal, pp. 623–632 (2007)
Google Scholar
Zhai, C., Lafferty, J.: A study of smoothing methods for language models applied to ad hoc information retrieval. In: SIGIR 2001, New Orleans, Louisiana, United States, pp. 334–342 (2001)
Google Scholar

Download references

Author information

Authors and Affiliations

Université de Grenoble, France
Karam Abdulahhad
UPMF-Grenoble 2, France
Jean-Pierre Chevallet
LIG Laboratory, MRIM Group, UJF-Grenoble 1, Grenoble, France
Catherine Berrut

Authors

Karam Abdulahhad
View author publications
You can also search for this author in PubMed Google Scholar
Jean-Pierre Chevallet
View author publications
You can also search for this author in PubMed Google Scholar
Catherine Berrut
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Instituto Tecnológico de Informática, Valencia, Spain
Hendrik Decker
Faculty of Electrical Engineering, Department of Cybernetics, Czech Technical University in Prague, 166 27, Prague 6, Czech Republic
Lenka Lhotská
Department of Computer Science, The University of Auckland, 1010, Auckland, New Zealand
Sebastian Link
Department of Information Technologies, University of Economics, Winston Churchill Square 4, 130 67, Prague 3, Czech Republic
Josef Basl
Institute of Software Technology, Vienna University of Technology, Favoritenstraße 9-11 / 188, 1040, Vienna, Austria
A Min Tjoa

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Abdulahhad, K., Chevallet, JP., Berrut, C. (2013). Revisiting the Term Frequency in Concept-Based IR Models. In: Decker, H., Lhotská, L., Link, S., Basl, J., Tjoa, A.M. (eds) Database and Expert Systems Applications. DEXA 2013. Lecture Notes in Computer Science, vol 8055. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-40285-2_8

Download citation

DOI: https://doi.org/10.1007/978-3-642-40285-2_8
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-40284-5
Online ISBN: 978-3-642-40285-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics