Encyclopedia of Database Systems

2018 Edition
| Editors: Ling Liu, M. Tamer Özsu

Aggregation-Based Structured Text Retrieval

  • Theodora Tsikrika
Reference work entry
DOI: https://doi.org/10.1007/978-1-4614-8265-9_14

Definition

Text retrieval is concerned with the retrieval of documents in response to user queries. This is achieved by (i) representing documents and queries with indexing features that provide a characterisation of their information content, and (ii) defining a function that uses these representations to perform retrieval. Structured text retrieval introduces a finer-grained retrieval paradigm that supports the representation and subsequent retrieval of the individual document components defined by the document’s logical structure. Aggregation-based structured text retrieval defines (i) the representation of each document component as the aggregation of the representation of its own information content and the representations of information content of its structurally related components, and (ii) retrieval of document components based on these (aggregated) representations.

The aim of aggregation-based approaches is to improve retrieval effectiveness by capturing and exploiting the...

This is a preview of subscription content, log in to check access.

Recommended Reading

  1. 1.
    Chiaramella Y. Information retrieval and structured documents. In Lectures on information retrieval, Third European Summer-School, Revised Lectures, LNCS, Agosti M, Crestani F, and Pasi G (eds.). Vol. 1980. Springer; 2001, p. 286–309.Google Scholar
  2. 2.
    Chiaramella Y, Mulhem P, and Fourel F. A model for multimedia information retrieval. Technical Report FERMI, ESPRIT BRA 8134, University of Glasgow, Scotland; 1996.Google Scholar
  3. 3.
    Croft WB. Combining approaches to information retrieval. In Advances in information retrieval: Recent research from the center for intelligent information retrieval, Croft WB (ed.). The Information retrieval series, Vol. 7. Kluwer Academic, Dordrecht; 2000,p. 1–36.Google Scholar
  4. 4.
    Fuhr N, Gövert N, and Rölleke T. DOLORES: A system for logic-based retrieval of multimedia objects. In: Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval; 1998. p. 257–65.Google Scholar
  5. 5.
    Fuhr N and Großjohann K. XIRQL: A query language for information retrieval in XML documents. In: Proceedings of the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval; 2001. p. 172–80.Google Scholar
  6. 6.
    Gövert N, Abolhassani M, Fuhr N, and Großjohann K. Content-oriented XML retrieval with HyREX. In: Proceedings of the 1st International Workshop of the Initiative for the Evaluation of XML Retrieval; 2002. p. 26–32.Google Scholar
  7. 7.
    Kazai G, Lalmas M, and Rölleke T A model for the representation and focussed retrieval of structured documents based on fuzzy aggregation. In: Proceedings of the 8th International Symposium on String Processing and Information Retrieval; 2001. p. 123–35.Google Scholar
  8. 8.
    Lalmas M. Dempster-Shafer’s theory of evidence applied to structured documents: Modelling uncertainty. In: Proceedings of the 20th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval; 1997. p. 110–18.Google Scholar
  9. 9.
    Lu W, Robertson SE, and MacFarlane A. Field-weighted XML retrieval based on BM25. In: Proceedings of the 4th International Workshop of the Initiative for the Evaluation of XML Retrieval; 2006. p. 161–71.Google Scholar
  10. 10.
    Mass Y and Mandelbrod M. Retrieving the most relevant XML components. In: Proceedings of the 2nd International Workshop of the Initiative for the Evaluation of XML Retrieval; 2003. p. 53–58.Google Scholar
  11. 11.
    Myaeng SH, Jang DH, Kim MS, and Zhoo ZC. A flexible model for retrieval of SGML documents. In: Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval; 1998. p. 138–45.Google Scholar
  12. 12.
    Ogilvie P and Callan J. Hierarchical language models for retrieval of XML components. In Advances in XML Information Retrieval and Evaluation. In: Proceedings of the 3rd International Workshop of the Initiative for the Evaluation of XML Retrieval; 2004. p. 224–37.Google Scholar
  13. 13.
    Robertson SE, Zaragoza H, and Taylor M. Simple BM25 extension to multiple weighted fields. In: Proceedings of the 13th ACM International Conference on Information and Knowledge Management; 2004. p. 42–9.Google Scholar
  14. 14.
    Sauvagnat K, Boughanem M, and Chrisment C. Searching XML documents using relevance propagation. In: Proceedings of the 11th International Symposium on String Processing and Information Retrieval; 2004. p. 242–54.CrossRefGoogle Scholar
  15. 15.
    Wilkinson R. Effective retrieval of structured documents. In: Proceedings of the 17th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval; 1994. p. 311–17.CrossRefGoogle Scholar

Copyright information

© Springer Science+Business Media, LLC, part of Springer Nature 2018

Authors and Affiliations

  1. 1.Center for Mathematics and Computer ScienceAmsterdamThe Netherlands

Section editors and affiliations

  • Jaap Kamps
    • 1
  1. 1.University of AmsterdamAmsterdamThe Netherlands