Skip to main content

Structural Feedback for Keyword-Based XML Retrieval

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 3936))

Abstract

Keyword-based queries are an important means to retrieve information from XML collections with unknown or complex schemas. Relevance Feedback integrates relevance information provided by a user to enhance retrieval quality. For keyword-based XML queries, feedback engines usually generate an expanded keyword query from the content of elements marked as relevant or nonrelevant. This approach that is inspired by text-based IR completely ignores the semistructured nature of XML. This paper makes the important step from pure content-based to structural feedback. It presents a framework that expands a keyword query into a full-fledged content-and-structure query. Extensive experiments with the established INEX benchmark and our TopX search engine show the feasibility of our approach.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Amati, G., Carpineto, C., Romano, G.: Merging XML indices. In: Fuhr, N., Lalmas, M., Malik, S., Szlávik, Z. (eds.) INEX 2004. LNCS, vol. 3493, pp. 253–260. Springer, Heidelberg (2005)

    Chapter  Google Scholar 

  2. Balmin, A., et al.: A system for keyword proximity search on XML databases. In: Aberer, K., Koubarakis, M., Kalogeraki, V. (eds.) VLDB 2003. LNCS, vol. 2944, pp. 1069–1072. Springer, Heidelberg (2004)

    Google Scholar 

  3. Blanken, H.M., Grabs, T., Schek, H.-J., Schenkel, R., Weikum, G. (eds.): Intelligent Search on XML Data. LNCS, vol. 2818. Springer, Heidelberg (2003)

    MATH  Google Scholar 

  4. Crouch, C.: Relevance feedback at the INEX 2004 workshop. In: INEX 2004 (2005)

    Google Scholar 

  5. Crouch, C.J., Mahajan, A., Bellamkonda, A.: Flexible XML retrieval based on the extended vector model. In: Fuhr, N., Lalmas, M., Malik, S., Szlávik, Z. (eds.) INEX 2004. LNCS, vol. 3493, pp. 149–153. Springer, Heidelberg (2005)

    Chapter  Google Scholar 

  6. Efthimiadis, E.N.: A user-centred evaluation of ranking algorithms for interactive query expansion. In: SIGIR Forum (USA), special issue, pp. 146–159 (1993)

    Google Scholar 

  7. Grust, T.: Accelerating XPath location steps. In: SIGMOD 2002, pp. 109–120 (2002)

    Google Scholar 

  8. Guo, L., et al.: XRANK: ranked keyword search over XML documents. In: SIGMOD 2003, pp. 16–27 (2003)

    Google Scholar 

  9. Hlaoua, L., Boughanem, M.: Towards context and structural relevance feedback in XML retrieval. In: Workshop on Open Source Web Information Retrieval, OSWIR (2005), http://www.emse.fr/OSWIR05/

  10. Hsu, W., Lee, M.L., Wu, X.: Path-augmented keyword search for XML documents. In: ICTAI 2004, pp. 526–530 (2004)

    Google Scholar 

  11. INEX relevance feedback track, http://inex.is.informatik.uni-duisburg.de:2004/tracks/rel/

  12. Kazai, G., et al.: The INEX evaluation initiative. In: Blanken, et al. (eds.) [3], pp. 279–293

    Google Scholar 

  13. Liu, S., Zou, Q., Chu, W.: Configurable indexing and ranking for XML information retrieval. In: SIGIR 2004, pp. 88–95 (2004)

    Google Scholar 

  14. Mass, Y., Mandelbrod, M.: Relevance feedback for XML retrieval. In: Fuhr, N., Lalmas, M., Malik, S., Szlávik, Z. (eds.) INEX 2004. LNCS, vol. 3493, pp. 154–157. Springer, Heidelberg (2005)

    Google Scholar 

  15. Mihajlovic̀, V., et al.: TIJAH at INEX 2004 modeling phrases and relevance feedback. In: Fuhr, N., Lalmas, M., Malik, S., Szlávik, Z. (eds.) INEX 2004. LNCS, vol. 3493, pp. 141–148. Springer, Heidelberg (2005)

    Google Scholar 

  16. Pan, H., Theobald, A., Schenkel, R.: Query refinement by relevance feedback in an XML retrieval system. In: Atzeni, P., Chu, W., Lu, H., Zhou, S., Ling, T.-W. (eds.) ER 2004. LNCS, vol. 3288, pp. 854–855. Springer, Heidelberg (2004)

    Chapter  Google Scholar 

  17. Porter, M., Galpin, V.: Relevance fedback in a public access catalogue for a research library: Muscat at the Scott Polar Research Institute. Program 22(1), 1–20 (1988)

    Article  Google Scholar 

  18. Ramirez, G., Westerveld, T., de Vries, A.: Structural features in content oriented xml retrieval. Technical Report INS-E0508, CWI, Centre for Mathematics and Computer Science (2005)

    Google Scholar 

  19. Ramírez, G., Westerveld, T., de Vries, A.P.: Structural features in content oriented XML retrieval. In: CIKM 2005 (2005)

    Google Scholar 

  20. Robertson, S.: On term selection for query expansion. Journal of Documentation 46, 359–364 (1990)

    Article  Google Scholar 

  21. Robertson, S., Sparck-Jones, K.: Relevance weighting of search terms. Journal of the American Society of Information Science 27, 129–146 (1976)

    Article  Google Scholar 

  22. Rocchio Jr., J.: Relevance feedback in information retrieval. In: Salton, G. (ed.) The SMART Retrieval System: Experiments in Automatic Document Processing, ch. 14, pp. 313–323. Prentice Hall, Englewood Cliffs (1971)

    Google Scholar 

  23. Ruthven, I., Lalmas, M.: A survey on the use of relevance feedback for information access systems. Knowledge Engineering Review 18(1) (2003)

    Google Scholar 

  24. Sigurbjörnsson, B., Kamps, J., de Rijke, M.: The University of Amsterdam at INEX 2004. In: Fuhr, N., Lalmas, M., Malik, S., Szlávik, Z. (eds.) INEX 2004. LNCS, vol. 3493, pp. 104–109. Springer, Heidelberg (2005)

    Chapter  Google Scholar 

  25. Theobald, M., Schenkel, R., Weikum, G.: An efficient and versatile query engine for TopX search. In: VLDB 2005, pp. 625–636 (2005)

    Google Scholar 

  26. Theobald, M., Schenkel, R., Weikum, G.: TopX & XXL at INEX 2005. In: Fuhr, N., Lalmas, M., Malik, S., Kazai, G. (eds.) INEX 2005. LNCS, vol. 3977, pp. 282–295. Springer, Heidelberg (2006)

    Google Scholar 

  27. Trotman, A., Sigurbjörnsson, B.: Narrowed Extended XPath I, NEXI (2004), Available at http://www.cs.otago.ac.nz/postgrads/andrew/2004-4.pdf

  28. van Rijsbergen, C., Harper, D., Porter, M.: The selection of good search terms. Information Processing and Management 17(2), 77–91 (1981)

    Article  Google Scholar 

  29. Vittaut, J.-N., Piwowarski, B., Gallinari, P.: An algebra for structured queries in bayesian networks. In: Fuhr, N., Lalmas, M., Malik, S., Szlávik, Z. (eds.) INEX 2004. LNCS, vol. 3493, pp. 58–64. Springer, Heidelberg (2005)

    Chapter  Google Scholar 

  30. Weber, R.: Using relevance feedback in XML retrieval. In: Blanken, et al. (eds.) [3], pp. 133–143

    Google Scholar 

  31. Xu, Y., Papakonstantinou, Y.: Efficient keyword search for smallest LCAs in XML databases. In: SIGMOD 2005, pp. 537–538 (2005)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Schenkel, R., Theobald, M. (2006). Structural Feedback for Keyword-Based XML Retrieval. In: Lalmas, M., MacFarlane, A., Rüger, S., Tombros, A., Tsikrika, T., Yavlinsky, A. (eds) Advances in Information Retrieval. ECIR 2006. Lecture Notes in Computer Science, vol 3936. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11735106_29

Download citation

  • DOI: https://doi.org/10.1007/11735106_29

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-33347-0

  • Online ISBN: 978-3-540-33348-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics