Synonyms
Content-oriented XML retrieval; Focused retrieval; Structured document retrieval; Structured text retrieval
Definition
Text documents often contain a mixture of structured and unstructured content. One way to format this mixed content is according to the adopted W3C standard for information repositories and exchanges, the eXtensible Mark-up Language (XML). In contrast to HTML, which is mainly layout-oriented, XML follows the fundamental concept of separating the logical structure of a document from its layout. This logical document structure can be exploited to allow a more focused sub-document retrieval.
XML retrieval breaks away from the traditional retrieval unit of a document as a single large (text) block and aims to implement focused retrievalstrategies aiming at returning document components, i.e., XML elements, instead of whole documents in response to a user query. This focused retrieval strategy is believed to be of particular benefit for information repositories...
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Recommended Reading
Amer-Yahia S, Lalmas M. XML search: languages, INEX and scoring. ACM SIGMOD Rec. 2006;35(4):16–23.
Baeza-Yates R, Fuhr N, Maarek YS, editors. Special issue on XML retrieval. ACM Trans Inf Syst 2006;24(4).
Blanken HM, Grabs T, Schek H-J, Schenkel R, Weikum G, editors. Intelligent search on XML data, applications, languages, models, implementations, and benchmarks. Berlin: Springer; 2003.
Denoyer L, Gallinari P. The Wikipedia XML corpus, comparative evaluation of XML information retrieval systems. In: Proceedings of the 5th International Workshop of the Initiative for the Evaluation of XML Retrieval; 2007. p. 12–19.
Fuhr N, Lalmas M, editors. Special issue on INEX. Inf. Retr. 2005;8(4).
Kamps J, de Rijke M, Sigurbjörnsson B. The importance of length normalization for XML retrieval. Inf Retr. 2005;8(4):631–54.
Kazai G, Gövert N, Lalmas M, Fuhr N. The INEX evaluation initiative. In: Blanken HM, Grabs T, Schek H, Schenkel R, Weikum G, editors. Intelligent search on XML data, applications, languages, models, implementations, and benchmarks. Springer; 2003. p. 279–93.
Kazai G, Lalmas M, Reid J. Construction of a test collection for the focused retrieval of structured documents. In: Proceedings of the 25th European Conference on IR Research; 2003. p. 88–103.
Lalmas M, Tombros A. INEX 2002–2006: Understanding XML retrieval evaluation. In: Proceedings of the 1st International DELOS Conference; Pisa; 2007. p. 187–96.
Mass Y, Mandelbrod M. Component ranking and automatic query refinement for XML retrieval. In: Proceedings of 3rd International Workshop of the Initiative for the Evaluation of XML Retrieval; 2004. p. 73–84.
Pharo N, Trotman A. The use case track at INEX 2006. SIGIR Forum. 2007;41(1):64–6.
van Zwol R, Baas J, van Oostendorp H, Wiering F. Bricks: the building blocks to tackle query formulation in structured document retrieval. In: Proceedings of the 28th European Conference on IR Research; 2006. p. 314–25.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Section Editor information
Rights and permissions
Copyright information
© 2018 Springer Science+Business Media, LLC, part of Springer Nature
About this entry
Cite this entry
Lalmas, M., Trotman, A. (2018). XML Retrieval. In: Liu, L., Özsu, M.T. (eds) Encyclopedia of Database Systems. Springer, New York, NY. https://doi.org/10.1007/978-1-4614-8265-9_474
Download citation
DOI: https://doi.org/10.1007/978-1-4614-8265-9_474
Published:
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4614-8266-6
Online ISBN: 978-1-4614-8265-9
eBook Packages: Computer ScienceReference Module Computer Science and Engineering