Sampling Techniques for Statistical Databases
A sampling technique is a method by which one inspects only a small portion of data from a database to reduce the time to compute an aggregate query, but simultaneously ensuring that result computed on the sample faithfully represents the true results of the query for the entire data population.
Acceptance-Rejection sampling(AR sampling) is sampling technique.
Sampling is used in a database for different reasons such as (i) to estimate the results of aggregate queries (e.g., SUM, COUNT, orAVERAGE), (ii) to retrieve a sample of records from a database query for subsequent processing, (iii) for internal use by the query optimizer for selectivity estimation, (iv) to provide privacy protection for records on individuals contained in statistical databases. It has been determined that fixed size random sampling of data does not yield a true representation of the population. Acceptance/rejection (A/R) samplingis used to construct weighted samples in which the...