Advertisement

Searching XML Element Using Terms Propagation Method

  • Samia Berchiche-Fellag
  • Mohamed Mezghiche
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 8502)

Abstract

In this paper, we describe terms propagation method dealing with focussed XML component retrieval. Focussed XML component retrieval is one of the most important challenge in the XML IR field. The aim of the focussed retrieval approach is to find the most exhaustive and specific element that focus on the user need. These needs can be expressed through content queries composed of simple keyword. Our method provides a natural representation of document, its elements and its content, and allows an automatic selection of a combination of elements that better answers the user’s query. In this paper we show the efficiency of the terms propagation method using a terms weighting formula that takes into account the size of the nodes and the size of the document. Our method has been evaluated on the «Focused» task of INEX 2006 and compared to XFIRM model which is based on relevance propagation method. Evaluations have shown a significant improvement in the retrieval process efficiency.

Keywords

Structured Information Retrieval (SIR) XML terms propagation CO query terms weighting element INEX 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Lalmas, M.: Dempster-Shafer’s theory of evidence applied to structured documents: Modeling uncertainty. In: Proceedings of ACM-SIGIR, Philadelphia, pp. 110–118 (1997)Google Scholar
  2. 2.
    Mass, Y., Mandelbrod, M.: Retrieving the most relevant XML Component. In: Proceedings of the Second Workshop of the Initiative for the Evaluation of XML Retrieval (INEX), December 15-17 (2003)Google Scholar
  3. 3.
    Mass, Y., Mandelbrod, M.: Component Ranking and Automatic Query Refinement for XML Retrieval. In: Fuhr, N., Lalmas, M., Malik, S., Szlávik, Z. (eds.) INEX 2004. LNCS, vol. 3493, pp. 73–84. Springer, Heidelberg (2005)CrossRefGoogle Scholar
  4. 4.
    Mass, Y., Mandelbrod, M.: Using the INEX Environment as a Test Bed for various User Models for XML Retrieval. In: Fuhr, N., Lalmas, M., Malik, S., Kazai, G. (eds.) INEX 2005. LNCS, vol. 3977, pp. 187–195. Springer, Heidelberg (2006)Google Scholar
  5. 5.
    Berchiche-Fellag, S., Mezghiche, M.: XML Element Retrieval using terms propagation. In: International Conference on Automation, Control, Engineering and Computer Science, ACECS 2014 (2014) (to be published)Google Scholar
  6. 6.
    Grabs, T., Scheck, H.J.: Flexible information retrieval from XML with Power DB XML. In: Proceedings of the First Annual Workshop of INEX, pp. 141–148 (December 2002)Google Scholar
  7. 7.
    Kakade, V., Raghavan, P.: Encoding XML in vector spaces. In: Losada, D.E., Fernández-Luna, J.M. (eds.) ECIR 2005. LNCS, vol. 3408, pp. 96–111. Springer, Heidelberg (2005)CrossRefGoogle Scholar
  8. 8.
    Fuhr, N., Malik, S., Lalmas, M.: Overview of the initiative for the evaluation of XML retrieval (INEX) 2003. In: Proceedings of INEX 2003 Workshop, Dagstuhl, Germany (December 2003)Google Scholar
  9. 9.
    Ogilvie, P., Callan, J.: Using language models for flat text queries in XML retrieval. In: Proceedings of INEX 2003 Workshop, Dagstuhl, Germany, pp. 12–18 (December 2003)Google Scholar
  10. 10.
    Kamps, J., Rijke, M., Sigurbjornsson, B.: Length normalization in XML retrieval. In: Proceedings of SIGIR 2004, Sheffield, England, pp. 80–87 (2004)Google Scholar
  11. 11.
    Piwowarski, B., Faure, G.E., Gallinari, P.: Bayesian Networks and INEX. In: Proceeding in the First Annual Workshop for the Evaluation of Xml Retrieval, INEX (2002)Google Scholar
  12. 12.
    Anh, V.N., Moffat, A.: Compression and an IR approach to XML Retrieval. In: INEX 2002 Workshop Proceedings, Germany, pp. 100–104 (2002)Google Scholar
  13. 13.
    Sauvagnat, K.: Modèle flexible pour la recherche d’information dans des corpus de documents semi-structurés. Thèse Doctorat, Université Paul Sabatier de Toulouse (2005)Google Scholar
  14. 14.
    Fuhr, N., Grossjohann, K.: XIRQL, a query language for information retrieval in XML documents. In: Proceedings of SIGIR 2001, Toronto, Canada (2001)Google Scholar
  15. 15.
    Gövert, N., Abolhassanni, M., Fuhr, N., Grossjohann, K.: Content-Oriented XML Retrieval with HyreX. In: INEX 2002 Workshop Proceedings, Germany, pp. 26–32 (2002)Google Scholar
  16. 16.
    Cui, H., Wen, J.-R., Chua, J.-R.: Hierarchical indexing and flexible element retrieval for structured document (April 2003)Google Scholar
  17. 17.
    Ben Aouicha, M.: Une approche algébrique pour la recherche d’information structurée. Thèse de doctorat en informatique, Université Paul Sabatier, Toulouse (2009)Google Scholar
  18. 18.
    Berchiche-Fellag, S., Boughanem, M.: Traitement des requêtes CO (Content Only) sur un corpus de documents XML. In: Colloque sur l’Optimisation et les Systèmes d’Information (2010)Google Scholar
  19. 19.
    Denoyer, L., Gallinari, P.: The Wikipedia XML corpus. SIGIR Forum 40(1), 64–69 (2006)CrossRefGoogle Scholar

Copyright information

© Springer International Publishing Switzerland 2014

Authors and Affiliations

  • Samia Berchiche-Fellag
    • 1
  • Mohamed Mezghiche
    • 2
  1. 1.Université Mouloud Mammeri de Tizi-OuzouTizi-OuzouAlgérie
  2. 2.Université M’Hamed Bougara BoumerdesAlgérie

Personalised recommendations