Abstract
One source of partial information in databases is the need to combine information from several databases. Even if each database is complete for some “world”, the combined databases will not be, and answers to queries against such combined databases can only be approximated. In this paper we describe various situations in which a precise answer cannot be obtained for a query asked against multiple databases. Based on an analysis of these situations, we propose a classification of constructs that can be used to model approximations.
A major goal is to obtain universality properties for these models of approximations. Universality properties suggest syntax for languages with approximations based on the operations which are naturally associated with them. We prove universality properties for most of the approximation constructs. Then we use them to design languages built around datatypes given by the approximation constructs. A straightforward approach results in langauges that have a number of limitations. In an attempt to overcome those limitations, we explain how all the languages can be embedded into a language for conjunctive and disjunctive sets from [17], and demonstrate its usefulness in querying independent databases.
Acknowledgements. I would like to thank Peter Buneman, Tim Griffin, Carl Gunter, Elsa Gunter, Paris Kanellakis, Hermann Puhlmann, Anna Romanowska and especially Achim Jung for their help.
Preview
Unable to display preview. Download preview PDF.
References
J. Biskup. A formal approach to null values in database relations. In: “Advances in Data Base Theory”, Volume 1, Prenum Press, New York, 1981.
V. Breazu-Tannen, P. Buneman, and L. Wong. Naturally embedded query languages. In LNCS 646: Proc. ICDT-92, pages 140–154, Springer-Verlag.
P. Buneman, S. Davidson, A. Watters. A semantics for complex objects and approximate answers. JCSS 43(1991), 170–218.
P. Buneman, A. Jung, A. Ohori. Using powerdomains to generalize relational databases. Theoretical Computer Science 91(1991), 23–55.
L. Cardelli. Types for data-oriented languages. In LNCS 303: Proc. EDBT-88.
S. Chaudhuri. Finding nonrecursive envelopes for database predicates. In PODS-93, pages 135–146.
S. Chaudhuri, Ph. Kolaitis. Can Datalog be approximated? In PODS-94, pages 86–96.
L. Colby. A recursive algebra for nested relations. Inf. Syst. 15 (1990), 567–582.
G. Grahne. “The Problem of Incomplete Information in Relational Databases”. Springer, Berlin, 1991.
C. Gunter. The mixed powerdomain. Theoretical Computer Science 103 (1992), 311–334.
C. Gunter. “Semantics of Programming Languages”. The MIT Press, 1992.
E. Gunter and L. Libkin. OR-SML: A functional database programming language for disjunctive information and its applications. LNCS 856: Proc. DEXA-94, pages 641–650.
T. Imielinski and W. Lipski. Incomplete information in relational databases. J. ACM 31(1984), 761–791.
T. Imielinski, S. Naqvi, and K. Vadaparty. Incomplete objects — a data model for design and planning applications. In Proc. SIGMOD-91, pages 288–297.
L. Libkin. A relational algebra for complex objects based on partial information. In LNCS 495: Proc. MFDBS-91, pages 36–41, Springer-Verlag.
L. Libkin. “Aspects of Partial Information in Databases”. PhD Thesis, University of Pennsylvania, 1994.
L. Libkin and L. Wong. Semantic representations and query languages for or-sets. In PODS-93, pages 37–48.
W. Lipski. On semantic issues connected with incomplete information in databases. ACM Trans. Database Systems 4 (1979), 262–296.
W. Lipski. On databases with incomplete information. J. ACM 28 (1981), 41–70.
T.-H. Ngair. “Convex Spaces as an Order-theoretic Basis for Problem Solving”. Technical Report MS-CIS-92-60, University of Pennsylvania, 1992.
H. Puhlmann. The snack powerdomain for database semantics. In LNCS 711: Proc. MFCS-93, Springer Verlag, 1993, pages 650–659.
A. Romanowska and J.D.H. Smith. “Modal Theory: An Algebraic Approach to Order, Geometry and Convexity”. Heldermann Verlag, Berlin, 1985.
V. Tannen. Tutorial: Languages for collection types. In PODS-94, pages 150–154.
C. Zaniolo. Database relations with null values. JCSS 28 (1984), 142–166.
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1995 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Libkin, L. (1995). Approximation in databases. In: Gottlob, G., Vardi, M.Y. (eds) Database Theory — ICDT '95. ICDT 1995. Lecture Notes in Computer Science, vol 893. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-58907-4_31
Download citation
DOI: https://doi.org/10.1007/3-540-58907-4_31
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-58907-5
Online ISBN: 978-3-540-49136-1
eBook Packages: Springer Book Archive