Abstract
This paper describes the current state of our system for structured retrieval. The system itself is based on an extension of the vector space model initially proposed by Fox [5]. The basic functions are performed using the Smart experimental retrieval system [10]. The major advance in our system this year is the incorporation of a facility for the dynamic retrieval of elements, which we refer to as flexible retrieval. This approach allows the system to return a rank-ordered list of elements based on a single indexing of the collection at the paragraph level.Lnu term weights [12,13] are generated dynamically along with the elements themselves, thus eliminating the need for propagation. Experimental results using this technique on INEX 2006 data show that it can produce results competitive with those produced by retrieval on an all-element index of the collection (and in fact produces virtually identical results for the new Fetch-and-Browse task). Early relevance feedback results are also reported.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Crouch, C., Apte, S., Bapat, H.: Using the extended vector model for XML retrieval. In: Proc of the First Workshop of the Initiative for the Evaluation of XML Retrieval (INEX), Schloss Dagstuhl, pp. 99–104 (2002)
Crouch, C., Apte, S., Bapat, H.: An approach to structured retrieval based on the extended vector model. In: Proc. of the Second Workshop of the Initiative for the Evaluation of XML Retrieval (INEX), Schloss Dagstuhl, pp. 87–93 (2003)
Crouch, C., Mahajan, A., Bellamkonda, A.: Flexible retrieval based on the vector space model. In: Advances in XML Information Retrieval, Third International Workshop of the Initiative for the Evaluation of XML Retrieval (INEX), Schloss Dagstuhl, pp. 292–302 (2005)
Doddapaneni, N.: Effective structured query processing. M.S. Thesis, Department of Computer Science, University of Minnesota Duluth, Duluth, MN (2005), http://www.d.umn.edu/cs/thesis/doddapaneni.pdf
Fox, E.A.: Extending the Boolean and vector space models of information retrieval with p-norm queries and multiple concept types. Ph.D. Dissertation, Department of Computer Science, Cornell University (1983)
Kamps, J., de Rijke, M., Sigurbjornsson, B.: Length normalization in XML retrieval. In: Proc of the 27th Annual International ACM SIGIR Conference, Sheffield, England, pp. 80–87 (2004)
Khanna, S.: Design and implementation of a flexible retrieval system. M.S. Thesis, Department of Computer Science, University of Minnesota Duluth, Duluth, MN (2005), http://www.d.umn.edu/cs/thesis/khanna.pdf
Potnis, P.: Relevance feedback in a flexible retrieval environment. M.S.Thesis, Department of Computer Science, University of Minnesota Duluth, Duluth, MN (2005), http://www.d.umn.edu/cs/thesis/potnis.pdf
Rocchio, J.: Relevance feedback in information retrieval. In: Salton, G. (ed.) The Smart System—Experiments in Automatic Document Processing, pp. 313–323. Prentice-Hall, Englewood Cliffs (1971)
Salton, G. (ed.): The Smart Rretrieval System—Experiments in Automatic Document Processing. Prentice-Hall, Englewood Cliffs (1971)
Salton, G., Wong, A., Yang, C.S.: A vector space model for automatic indexing. Comm. ACM 18, 11, 613–620 (1975)
Singhal, A.: AT&T at TREC-6. In: The Sixth Text REtrieval Conf. (TREC-6), NIST SP 500-240, 215–225 (1998)
Singhal, A., Buckley, C., Mitra, M.: Pivoted document length normalization. In: Proc. of the 19th Annual International ACM SIGIR Conference, Zurich, pp. 21–29 (1996)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Crouch, C.J., Khanna, S., Potnis, P., Doddapaneni, N. (2006). The Dynamic Retrieval of XML Elements. In: Fuhr, N., Lalmas, M., Malik, S., Kazai, G. (eds) Advances in XML Information Retrieval and Evaluation. INEX 2005. Lecture Notes in Computer Science, vol 3977. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-34963-1_20
Download citation
DOI: https://doi.org/10.1007/978-3-540-34963-1_20
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-34962-4
Online ISBN: 978-3-540-34963-1
eBook Packages: Computer ScienceComputer Science (R0)