Structured Document Retrieval
Focused retrieval; Passage retrieval; Querying semi-structured data; Structured text retrieval; XML retrieval
Structured document retrieval is concerned with the retrieval of document fragments. The structure of the document, whether explicitly provided by a mark-up language or derived, is exploited to determine the most relevant document fragments to return as answers to a given query. The identified most relevant document fragments can themselves be used to determine the most relevant documents to return as answers to the given query.
The aim of this entry is to clarify different terminologies that have been used to refer to or are strongly related to structured retrieval and semi-structured data.
The term “structured document retrieval,” which was introduced in the early to mid 1990s in the information retrieval community, refers to “passage retrieval” and “structured text retrieval.” In passage retrieval, documents are first decomposed into passages...
- 2.Kazai G, Gövert N, Lalmas M, The FN. INEX evaluation initiative. In intelligent search on XML data, applications, languages, models, implementations, and benchmarks. New York: Springer; 2003. p. 279–93.Google Scholar