Approximation and Scoring for XML Data Management

Buratti, Giacomo; Montesi, Danilo

doi:10.1007/978-3-540-88655-6_18

Approximation and Scoring for XML Data Management

Giacomo Buratti⁵ &
Danilo Montesi⁶

Conference paper

496 Accesses

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 22))

Abstract

XQuery Full-Text is the proposed standard language for querying XML documents using either standard or full-text conditions; while full-text conditions can have a boolean or a ranked semantics, standard conditions must be satisfied for an element to be returned. This paper proposes a more general formal model that considers structural, value-based and full-text conditions as desiderata rather than mandatory constraints. The goal is achieved defining a set of relaxation operators that, given a path expression or a selection condition, return a set of relaxed path expressions or selection conditions. Algebraic approximated operators are defined for representing typical queries; they return elements that perfectly respect the conditions, as well as elements that answer to a relaxed version of the original query. A score reflecting the level of satisfaction of the original query is assigned to each result.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

W3C: XQuery 1.0: An XML Query Language, W3C Recommendation (2007), http://www.w3.org/TR/xquery/
Baeza-Yates, R., Ribeiro-Neto, B.: Modern Information Retrieval. Addison-Wesley, Reading (1999)
Google Scholar
INEX: INitiative for the Evaluation of XML Retrieval (2006), http://inex.is.informatik.uni-duisburg.de/2006/
W3C: XQuery 1.0 and XPath 2.0 Full-Text, W3C Working Draft (2007), http://www.w3.org/TR/xquery-full-text/
Buratti, G.: A Model and an Algebra for Semi-Structured and Full-Text Queries (Ph.D. Thesis). Technical Report UBLCS-2007-03, University of Bologna (2007)
Google Scholar
Princeton University, C.S.L.: Wordnet (2007), http://wordnet.princeton.edu/
Amer-Yahia, S., Lakshmanan, L.V.S., Pandit, S.: FleXPath: Flexible Structure and Full-Text Querying for XML. In: SIGMOD, pp. 83–94 (2004)
Google Scholar
Theobald, A., Weikum, G.: The Index-Based XXL Search Engine for Querying XML Data with Relevance Ranking. In: Chaudhri, A.B., Unland, R., Djeraba, C., Lindner, W. (eds.) EDBT 2002. LNCS, vol. 2490, pp. 477–495. Springer, Heidelberg (2002)
Google Scholar
Amer-Yahia, S., Koudas, N., Marian, A., Srivastava, D., Toman, D.: Structure and Content Scoring for XML. In: VLDB, pp. 361–372 (2005)
Google Scholar
Marian, A., Amer-Yahia, S., Koudas, N., Srivastava, D.: Adaptive Processing of Top-K Queries in XML. In: ICDE, pp. 162–173 (2005)
Google Scholar
Levenshtein, V.I.: Binary codes capable of correcting deletions, insertions, and reversals. Soviet Physics Doklady 10, 707–710 (1966)
MathSciNet MATH Google Scholar
Fagin, R., Wimmers, E.L.: A Formula for Incorporating Weights into Scoring Rules. Theoretical Computer Science 239, 309–338 (2000)
Article MathSciNet MATH Google Scholar

Download references

Author information

Authors and Affiliations

Department of Mathematics and Computer Science, University of Camerino, Via Madonna delle Carceri 9, Camerino, Italy
Giacomo Buratti
Department of Computer Science, University of Bologna, Mura Anteo Zamboni 7, Bologna, Italy
Danilo Montesi

Authors

Giacomo Buratti
View author publications
You can also search for this author in PubMed Google Scholar
Danilo Montesi
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Polytechnic Institute of Setúbal – INSTICC,, Av. D. Manuel I, 27A - 2. Esq., 2910-595, Setúbal, Portugal
Joaquim Filipe
Interdisciplinary Institute for Collaboration and Research on Enterprise Systems and Technology – IICREST, P.O. Box 104, 1618, Sofia, Bulgaria
Boris Shishkov
School of Computing, Dublin City University, Dublin 9, Ireland
Markus Helfert
Department of Computing, Macquarie University, NSW 2109, Sydney, Australia
Leszek A. Maciaszek

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Buratti, G., Montesi, D. (2008). Approximation and Scoring for XML Data Management. In: Filipe, J., Shishkov, B., Helfert, M., Maciaszek, L.A. (eds) Software and Data Technologies. ICSOFT ENASE 2007 2007. Communications in Computer and Information Science, vol 22. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-88655-6_18

Download citation

DOI: https://doi.org/10.1007/978-3-540-88655-6_18
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-88654-9
Online ISBN: 978-3-540-88655-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics