In structured text retrieval, the structure of a text component may be used to estimate the relevance of that component. This is done by associating a weight to the structure reflecting its significance when estimating the relevance of the component for a given query.
Associating weight to the structure of a component in itself is not new, and several investigations have been reported for whole document retrieval. This entry is concerned with structure weights in the context of structured text retrieval, where the aim is to exploit the document structure to return document components, instead of whole documents.
In structured text retrieval, not all document components will trigger the same user satisfaction when returned as answers to queries. In the context of structured documents mark-up in XML, some document components, i.e., XML elements, may not be appropriate to return because they are too small, or a tag type that does not contain informative content,...
- 1.Gery M, Largeron C, Thollard F. Probabilistic document model integrating XML structure. In: Proceedings of the 6th International Workshop of the Initiative for the Evaluation of XML Retrieval; 2007. p. 139–49.Google Scholar
- 2.Kamps J, de Rijke M, Sigurbjörnsson B. Length normalization in XML retrieval. In: Proceedings of the 27th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval; 2004. p. 80–7.Google Scholar
- 3.Mass Y., Mandelbrod M. Component ranking and automatic query refinement for XML retrieval. In: Advances in XML Information Retrieval, Proceedings of the 3rd International Workshop of the Initiative for the Evaluation of XML Retrieval, 2004, Revised Selected Papers; 2005. p. 73–84.Google Scholar