Abstract
Keyword-based queries are an important means to retrieve information from XML collections with unknown or complex schemas. Relevance Feedback integrates relevance information provided by a user to enhance retrieval quality. For keyword-based XML queries, feedback engines usually generate an expanded keyword query from the content of elements marked as relevant or nonrelevant. This approach that is inspired by text-based IR completely ignores the semistructured nature of XML. This paper makes the important step from pure content-based to structural feedback. It presents a framework that expands a keyword query into a full-fledged content-and-structure query. Extensive experiments with the established INEX benchmark and our TopX search engine show the feasibility of our approach.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Amati, G., Carpineto, C., Romano, G.: Merging XML indices. In: Fuhr, N., Lalmas, M., Malik, S., Szlávik, Z. (eds.) INEX 2004. LNCS, vol. 3493, pp. 253–260. Springer, Heidelberg (2005)
Balmin, A., et al.: A system for keyword proximity search on XML databases. In: Aberer, K., Koubarakis, M., Kalogeraki, V. (eds.) VLDB 2003. LNCS, vol. 2944, pp. 1069–1072. Springer, Heidelberg (2004)
Blanken, H.M., Grabs, T., Schek, H.-J., Schenkel, R., Weikum, G. (eds.): Intelligent Search on XML Data. LNCS, vol. 2818. Springer, Heidelberg (2003)
Crouch, C.: Relevance feedback at the INEX 2004 workshop. In: INEX 2004 (2005)
Crouch, C.J., Mahajan, A., Bellamkonda, A.: Flexible XML retrieval based on the extended vector model. In: Fuhr, N., Lalmas, M., Malik, S., Szlávik, Z. (eds.) INEX 2004. LNCS, vol. 3493, pp. 149–153. Springer, Heidelberg (2005)
Efthimiadis, E.N.: A user-centred evaluation of ranking algorithms for interactive query expansion. In: SIGIR Forum (USA), special issue, pp. 146–159 (1993)
Grust, T.: Accelerating XPath location steps. In: SIGMOD 2002, pp. 109–120 (2002)
Guo, L., et al.: XRANK: ranked keyword search over XML documents. In: SIGMOD 2003, pp. 16–27 (2003)
Hlaoua, L., Boughanem, M.: Towards context and structural relevance feedback in XML retrieval. In: Workshop on Open Source Web Information Retrieval, OSWIR (2005), http://www.emse.fr/OSWIR05/
Hsu, W., Lee, M.L., Wu, X.: Path-augmented keyword search for XML documents. In: ICTAI 2004, pp. 526–530 (2004)
INEX relevance feedback track, http://inex.is.informatik.uni-duisburg.de:2004/tracks/rel/
Kazai, G., et al.: The INEX evaluation initiative. In: Blanken, et al. (eds.) [3], pp. 279–293
Liu, S., Zou, Q., Chu, W.: Configurable indexing and ranking for XML information retrieval. In: SIGIR 2004, pp. 88–95 (2004)
Mass, Y., Mandelbrod, M.: Relevance feedback for XML retrieval. In: Fuhr, N., Lalmas, M., Malik, S., Szlávik, Z. (eds.) INEX 2004. LNCS, vol. 3493, pp. 154–157. Springer, Heidelberg (2005)
Mihajlovic̀, V., et al.: TIJAH at INEX 2004 modeling phrases and relevance feedback. In: Fuhr, N., Lalmas, M., Malik, S., Szlávik, Z. (eds.) INEX 2004. LNCS, vol. 3493, pp. 141–148. Springer, Heidelberg (2005)
Pan, H., Theobald, A., Schenkel, R.: Query refinement by relevance feedback in an XML retrieval system. In: Atzeni, P., Chu, W., Lu, H., Zhou, S., Ling, T.-W. (eds.) ER 2004. LNCS, vol. 3288, pp. 854–855. Springer, Heidelberg (2004)
Porter, M., Galpin, V.: Relevance fedback in a public access catalogue for a research library: Muscat at the Scott Polar Research Institute. Program 22(1), 1–20 (1988)
Ramirez, G., Westerveld, T., de Vries, A.: Structural features in content oriented xml retrieval. Technical Report INS-E0508, CWI, Centre for Mathematics and Computer Science (2005)
Ramírez, G., Westerveld, T., de Vries, A.P.: Structural features in content oriented XML retrieval. In: CIKM 2005 (2005)
Robertson, S.: On term selection for query expansion. Journal of Documentation 46, 359–364 (1990)
Robertson, S., Sparck-Jones, K.: Relevance weighting of search terms. Journal of the American Society of Information Science 27, 129–146 (1976)
Rocchio Jr., J.: Relevance feedback in information retrieval. In: Salton, G. (ed.) The SMART Retrieval System: Experiments in Automatic Document Processing, ch. 14, pp. 313–323. Prentice Hall, Englewood Cliffs (1971)
Ruthven, I., Lalmas, M.: A survey on the use of relevance feedback for information access systems. Knowledge Engineering Review 18(1) (2003)
Sigurbjörnsson, B., Kamps, J., de Rijke, M.: The University of Amsterdam at INEX 2004. In: Fuhr, N., Lalmas, M., Malik, S., Szlávik, Z. (eds.) INEX 2004. LNCS, vol. 3493, pp. 104–109. Springer, Heidelberg (2005)
Theobald, M., Schenkel, R., Weikum, G.: An efficient and versatile query engine for TopX search. In: VLDB 2005, pp. 625–636 (2005)
Theobald, M., Schenkel, R., Weikum, G.: TopX & XXL at INEX 2005. In: Fuhr, N., Lalmas, M., Malik, S., Kazai, G. (eds.) INEX 2005. LNCS, vol. 3977, pp. 282–295. Springer, Heidelberg (2006)
Trotman, A., Sigurbjörnsson, B.: Narrowed Extended XPath I, NEXI (2004), Available at http://www.cs.otago.ac.nz/postgrads/andrew/2004-4.pdf
van Rijsbergen, C., Harper, D., Porter, M.: The selection of good search terms. Information Processing and Management 17(2), 77–91 (1981)
Vittaut, J.-N., Piwowarski, B., Gallinari, P.: An algebra for structured queries in bayesian networks. In: Fuhr, N., Lalmas, M., Malik, S., Szlávik, Z. (eds.) INEX 2004. LNCS, vol. 3493, pp. 58–64. Springer, Heidelberg (2005)
Weber, R.: Using relevance feedback in XML retrieval. In: Blanken, et al. (eds.) [3], pp. 133–143
Xu, Y., Papakonstantinou, Y.: Efficient keyword search for smallest LCAs in XML databases. In: SIGMOD 2005, pp. 537–538 (2005)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Schenkel, R., Theobald, M. (2006). Structural Feedback for Keyword-Based XML Retrieval. In: Lalmas, M., MacFarlane, A., Rüger, S., Tombros, A., Tsikrika, T., Yavlinsky, A. (eds) Advances in Information Retrieval. ECIR 2006. Lecture Notes in Computer Science, vol 3936. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11735106_29
Download citation
DOI: https://doi.org/10.1007/11735106_29
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-33347-0
Online ISBN: 978-3-540-33348-7
eBook Packages: Computer ScienceComputer Science (R0)