Skip to main content

Uniform Representation of Content and Structure for structured document retrieval

  • Conference paper

Abstract

Documents often display a hierarchical structure. For example, a SGML document contains a title, several sections, which themselves contain paragraphs. In this paper, we develop a formal model to represent in a uniform manner structured documents by their content and structure. As a result, querying structured documents can be done with respect to their content, their structure, or both. The model is based on a possible worlds approach, modal operators and uncertainty distributions.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. S.H. Myaeng, D. H. Jang, M. S. Kim, and Z. C. Zhoo. A flexible model for retrieval of SGML documents. In Proceedings of ACM-SIGIR Conference on Research and Development in Information Retrieval, pages 138–145, Melbourne, Australia, 1998.

    Google Scholar 

  2. S. Abiteboul, S. Cluet, V. Christophides, T. Milo, G. Moerkotte, and J. Simeon. Querying documents in object databases. International Journal on Digital Libraries, 1:1–9, 1997.

    Article  Google Scholar 

  3. R. Wilkinson. Effective retrieval of structured documents. In Proceedings of ACM-SIGIR Conference on Research and Development in Information Retrieval, pages 311–317, Dublin, Ireland, 1994.

    Google Scholar 

  4. Y. Chiaramella and A. Kheirbek. An integrated model for hypermedia and information retrieval. Information Retrieval and Hypertext, 1996.

    Google Scholar 

  5. G. Salton, J. Allan, and C. Buckley. Approaches to passage retrieval in full text information systems. In Proceedings of ACM SIGIR Conference on Research and Development in Information Retrieval, pages 49–58, Pittsburgh, USA, 1993.

    Google Scholar 

  6. T. Roelleke and N. Fuhr. Retrieval of complex objects using a four-valued logic. In Proceedings of ACM-SIGIR Conference on Research and Development in Information Retrieval, pages 206–214, Zurich, Switzerland, 1996.

    Google Scholar 

  7. T. Roelleke. POOL: Probabilistic Object-Oriented Logical Representation and Retrieval of Complex Objects - A Model for Hypermedia Retrieval. Shaker Verlag, Aachen, 1999. Phd Thesis.

    Google Scholar 

  8. M. Lalmas and J. Ruthven. Representing and retrieving structured documents with Dempster-Shafer’s theory of evidence: Modelling and evaluation. Journal of Documentation, 54(5):529–565, 1998.

    Article  Google Scholar 

  9. M.E. Frisse. Searching for information in a hypertext medical handbook. Communications of the ACM, 31(7):880–886, 1988.

    Article  Google Scholar 

  10. Y. Chiaramella, P. Mulhem, and F. Fourel. A model for multimedia information retrieval. Technical Report Fermi ESPRIT BRA 8134, University of Glasgow, 1996.

    Google Scholar 

  11. M. Lalmas. Dempster-Shafer’s theory of evidence applied to structured documents: modelling uncertainty. In Proceedings of ACM SIGIR Conference on Research and Development in Information Retrieval, pages 110–118, Philadelphia, PA, USA, 1997.

    Google Scholar 

  12. G. Shafer. A Mathematical Theory of Evidence. Princeton University Press, 1976.

    Google Scholar 

  13. E. H. Ruspini. The logical foundations of evidential reasoning. Technical Report 408, SRI International, 1986.

    Google Scholar 

  14. C. J. van Rijsbergen. Information Retrieval. Butterworths, London, 2 edition, 1979.

    Google Scholar 

  15. R. Baeza-Yates and B. Ribeiro-Neto. Modern Information Retrieval. Addison Wesley, 1999.

    Google Scholar 

  16. M. Lalmas and E. Moutogianni. A Dempster-Shafer indexing for the focussed retrieval of hierarchically structured documents: Implememtation and experiments on a web museum collection. In RIAO, 2000.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2001 Springer-Verlag London

About this paper

Cite this paper

Lalmas, M. (2001). Uniform Representation of Content and Structure for structured document retrieval. In: Bramer, M., Preece, A., Coenen, F. (eds) Research and Development in Intelligent Systems XVII. Springer, London. https://doi.org/10.1007/978-1-4471-0269-4_16

Download citation

  • DOI: https://doi.org/10.1007/978-1-4471-0269-4_16

  • Publisher Name: Springer, London

  • Print ISBN: 978-1-85233-403-1

  • Online ISBN: 978-1-4471-0269-4

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics