Abstract
Our hypothesis is that assessing the relevance of a document with respect to a query is equivalent to assessing the conceptual similarity between the terms of the query and those of the document. In this article, we therefore propose a method of calculating conceptual similarity. Our information retrieval strategy is based on exploring an ontology and domain relations between concepts marked by verbal forms. Our approach overall is implemented by a prototype and the results obtained are evaluated. We thus show that a semantic IR system based on concepts improves recall with respect to a classic IR system and that a semantic IR system based on concepts and domain relations improves precision with respect to IR based on concepts alone.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Aussenac-Gilles, N., Kamel, M., Buscaldi, D., Comparot, C.: Construction dontologie partir d’une collection de pages web structures. In: Troncy, R. (ed.) Actes des Journées Francophones d’ingénierie des connaissances, IC 2013, Lille, France, pp. 1–16. AFIA (to appear, July 2013)
Bannour, I., Zargayouna, H.: Une plate-forme open-source de recherche d’information sémantique. In: CORIA 2012, Bordeaux, France, pp. 167–178 (2012)
Cimiano, P., Buitelaar, P., McCrae, J., Sintek, M.: Lexinfo: A declarative model for the lexicon-ontology interface. Web Semant. 9(1), 29–51 (2011)
Dudognon, D., Hubert, G., Ralalason, B.J.V.: ProxiGénéa: Une mesure de similarité conceptuelle. In: Colloque Veille Stratégique Scientifique et Technologique (VSST), October 2010, Université Paul Sabatier - Toulouse (2010) (support électronique), http://www.ups-tlse.fr
Dumais, S.T.: Latent semantic analysis. Annual Review of Information Science and Technology 38(1), 188–230 (2004)
Egozi, O., Markovitch, S., Gabrilovich, E.: Concept-based information retrieval using explicit semantic analysis. ACM Trans. Inf. Syst. 29(2), 8:1–8:34 (2011)
Fernández, M., Cantador, I., Lopez, V., Vallet, D., Castells, P., Motta, E.: Semantically enhanced information retrieval: An ontology-based approach. J. Web Sem. 9(4), 434–452 (2011)
Gabrilovich, E.: Feature generation for textual information retrieval using world knowledge. SIGIR Forum. 41(2), 123–123 (2007)
Jiang, J.J., Conrath, D.W.: Semantic similarity based on corpus statistics and lexical taxonomy. In: Proc. of the Int’l. Conf. on Research in Computational Linguistics, pp. 19–33 (1997)
McCrae, J., Spohr, D., Cimiano, P.: Linking lexical resources and ontologies on the semantic web with lemon. In: Antoniou, G., Grobelnik, M., Simperl, E., Parsia, B., Plexousakis, D., De Leenheer, P., Pan, J. (eds.) ESWC 2011, Part I. LNCS, vol. 6643, pp. 245–259. Springer, Heidelberg (2011)
Popov, B., Kiryakov, A., Ognyanoff, D., Manov, D., Kirilov, A.: Kim - a semantic platform for information extraction and retrieval. Natural Language Engineering 10(3-4), 375–392 (2004)
Resnik, P.: Semantic similarity in a taxonomy: An information-based measure and its application to problems of ambiguity in natural language. J. Artif. Intell. Res. (JAIR) 11, 95–130 (1999)
Roche, C., Calberg-Challot, M., Damas, L., Rouard, P.: Ontoterminology: A new paradigm for terminology. In: International Conference on Knowledge Engineering and Ontology Development, Madeira, Portugal, pp. 321–326 (2009)
Sanderson, M., Paramita, M.L., Clough, P., Kanoulas, E.: Do user preferences and evaluation measures line up? In: SIGIR, pp. 555–562 (2010)
Wu, Z., Palmer, M.: Verbs semantics and lexical selection. In: Proceedings of the 32nd Annual Meeting on Association for Computational Linguistics, ACL 1994, pp. 133–138. Association for Computational Linguistics, Stroudsburg (1994)
Zhong, M., Huang, X.: Concept-based biomedical text retrieval. In: Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2006, pp. 723–724. ACM, New York (2006)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Buscaldi, D., Bessagnet, MN., Royer, A., Sallaberry, C. (2014). Using the Semantics of Texts for Information Retrieval: A Concept- and Domain Relation-Based Approach. In: Catania, B., et al. New Trends in Databases and Information Systems. Advances in Intelligent Systems and Computing, vol 241. Springer, Cham. https://doi.org/10.1007/978-3-319-01863-8_28
Download citation
DOI: https://doi.org/10.1007/978-3-319-01863-8_28
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-01862-1
Online ISBN: 978-3-319-01863-8
eBook Packages: EngineeringEngineering (R0)