Abstract
Documents often display a hierarchical structure. For example, a SGML document contains a title, several sections, which themselves contain paragraphs. In this paper, we develop a formal model to represent in a uniform manner structured documents by their content and structure. As a result, querying structured documents can be done with respect to their content, their structure, or both. The model is based on a possible worlds approach, modal operators and uncertainty distributions.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
S.H. Myaeng, D. H. Jang, M. S. Kim, and Z. C. Zhoo. A flexible model for retrieval of SGML documents. In Proceedings of ACM-SIGIR Conference on Research and Development in Information Retrieval, pages 138–145, Melbourne, Australia, 1998.
S. Abiteboul, S. Cluet, V. Christophides, T. Milo, G. Moerkotte, and J. Simeon. Querying documents in object databases. International Journal on Digital Libraries, 1:1–9, 1997.
R. Wilkinson. Effective retrieval of structured documents. In Proceedings of ACM-SIGIR Conference on Research and Development in Information Retrieval, pages 311–317, Dublin, Ireland, 1994.
Y. Chiaramella and A. Kheirbek. An integrated model for hypermedia and information retrieval. Information Retrieval and Hypertext, 1996.
G. Salton, J. Allan, and C. Buckley. Approaches to passage retrieval in full text information systems. In Proceedings of ACM SIGIR Conference on Research and Development in Information Retrieval, pages 49–58, Pittsburgh, USA, 1993.
T. Roelleke and N. Fuhr. Retrieval of complex objects using a four-valued logic. In Proceedings of ACM-SIGIR Conference on Research and Development in Information Retrieval, pages 206–214, Zurich, Switzerland, 1996.
T. Roelleke. POOL: Probabilistic Object-Oriented Logical Representation and Retrieval of Complex Objects - A Model for Hypermedia Retrieval. Shaker Verlag, Aachen, 1999. Phd Thesis.
M. Lalmas and J. Ruthven. Representing and retrieving structured documents with Dempster-Shafer’s theory of evidence: Modelling and evaluation. Journal of Documentation, 54(5):529–565, 1998.
M.E. Frisse. Searching for information in a hypertext medical handbook. Communications of the ACM, 31(7):880–886, 1988.
Y. Chiaramella, P. Mulhem, and F. Fourel. A model for multimedia information retrieval. Technical Report Fermi ESPRIT BRA 8134, University of Glasgow, 1996.
M. Lalmas. Dempster-Shafer’s theory of evidence applied to structured documents: modelling uncertainty. In Proceedings of ACM SIGIR Conference on Research and Development in Information Retrieval, pages 110–118, Philadelphia, PA, USA, 1997.
G. Shafer. A Mathematical Theory of Evidence. Princeton University Press, 1976.
E. H. Ruspini. The logical foundations of evidential reasoning. Technical Report 408, SRI International, 1986.
C. J. van Rijsbergen. Information Retrieval. Butterworths, London, 2 edition, 1979.
R. Baeza-Yates and B. Ribeiro-Neto. Modern Information Retrieval. Addison Wesley, 1999.
M. Lalmas and E. Moutogianni. A Dempster-Shafer indexing for the focussed retrieval of hierarchically structured documents: Implememtation and experiments on a web museum collection. In RIAO, 2000.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2001 Springer-Verlag London
About this paper
Cite this paper
Lalmas, M. (2001). Uniform Representation of Content and Structure for structured document retrieval. In: Bramer, M., Preece, A., Coenen, F. (eds) Research and Development in Intelligent Systems XVII. Springer, London. https://doi.org/10.1007/978-1-4471-0269-4_16
Download citation
DOI: https://doi.org/10.1007/978-1-4471-0269-4_16
Publisher Name: Springer, London
Print ISBN: 978-1-85233-403-1
Online ISBN: 978-1-4471-0269-4
eBook Packages: Springer Book Archive