Abstract
This paper reports the result of experimentation of our approach using the vector space model for retrieving large-scale XML data. The purposes of the experiments are to improve retrieval precision on the INitiative for the Evaluation of XML Retrieval (INEX) 2008 Adhoc Track, and to compare the retrieval time of our system to other systems on the INEX 2008 Efficiency Track. For the INEX 2007 Adhoc Track, we developed a system using a relative inverted-path (RIP) list and a Bottom-UP approach. The system achieved reasonable retrieval time for XML data. However the system has a room for improvement in terms of retrieval precision. So for INEX 2008, the system uses CAS titles and Pseudo Relevance Feedback (PRF) to improve retrieval precision.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Extensible Markup Language (XML) 1.1, 2nd ed., http://www.w3.org/TR/xml11/
XML Path Language (XPath) Version 1.0., http://www.w3.org/TR/xpath
Salton, G., Wong, A., Yang, C.S.: A vector space model for automatic indexing. Communications of the ACM 18, 613–620 (1975)
Kamps, J., Pehcevski, J., Kazai, G., Lalmas, M., Robertson, S.: INEX 2007 Evaluation Measures. In: Fuhr, N., Kamps, J., Lalmas, M., Trotman, A. (eds.) INEX 2007. LNCS, vol. 4862, pp. 24–33. Springer, Heidelberg (2008)
Montgomery, J., Luo, S.L., Callan, J., Evans, D.A.: Effect of varying number of documents in blind feedback: analysis of the 2003 NRRC RIA workshop ”bf_numdocs” experiment suite. In: SIGIR 2004: Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval, pp. 476–477. ACM, New York (2004)
Tanioka, H.: A Fast Retrieval Algorithm for Large-Scale XML Data. In: Fuhr, N., Kamps, J., Lalmas, M., Trotman, A. (eds.) INEX 2007. LNCS, vol. 4862, pp. 129–137. Springer, Heidelberg (2008)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Tanioka, H. (2009). Pseudo Relevance Feedback Using Fast XML Retrieval. In: Geva, S., Kamps, J., Trotman, A. (eds) Advances in Focused Retrieval. INEX 2008. Lecture Notes in Computer Science, vol 5631. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-03761-0_22
Download citation
DOI: https://doi.org/10.1007/978-3-642-03761-0_22
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-03760-3
Online ISBN: 978-3-642-03761-0
eBook Packages: Computer ScienceComputer Science (R0)