Abstract
A model of random databases is given, with arbitrary correlations among the data of one individual. This is given by a joint distribution function. The individuals are chosen independently, their number m is considered to be (approximately) known. The probability of the event that a given functional dependency A → b holds (A is a set of attributes, b is an attribute) is determined in a limiting sense. This probability is small if m is much larger than \(2^{H_2(A\rightarrow b)/2}\) and is large if m is much smaller than \(2^{H_2(A\rightarrow b)/2}\) where H 2(A → b) is an entropy like functional of the probability distribution of the data.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Demetrovics, J., Katona, G.O.H., Miklós, D., Seleznjev, O., Thalheim, B.: Asymptotic properties of keys and functional dependencies in random databases. Theoretical Computer Sciences 190, 151–166 (1998)
Demetrovics, J., Katona, G.O.H., Miklós, D., Seleznjev, O., Thalheim, B.: Functional dependencies in random databases. Studia Sci. Math. Hungar. 34, 127–140 (1998)
Katona, G.O.H.: Testing functional connection between two random variables. Prokhorov Festschrift (accepted)
Rényi, A.: Some fundamental questions of information theory. MTA III Oszt. Közl. 10, 251–282 (1960) (in Hungarian)
Rényi, A.: On measures of information and entropy. In: Proc. of the 4th Berkeley Symposium on Mathematics, Statistics and Probability, pp. 547–561 (1960/1961)
Seleznjev, O., Thalheim, B.: Average Case Analysis in Database Problems. Methodology and Computing in Applied Probability 5(4), 395–418 (2003)
Seleznjev, O., Thalheim, B.: Random Databases with Approximate Record Matching. Methodology and Computing in Applied Probability 12(1), 63–89 (2010)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Katona, G.O.H. (2012). Random Databases with Correlated Data. In: Düsterhöft, A., Klettke, M., Schewe, KD. (eds) Conceptual Modelling and Its Theoretical Foundations. Lecture Notes in Computer Science, vol 7260. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-28279-9_4
Download citation
DOI: https://doi.org/10.1007/978-3-642-28279-9_4
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-28278-2
Online ISBN: 978-3-642-28279-9
eBook Packages: Computer ScienceComputer Science (R0)