Abstract
The selectivity factor of relational operations is a critical parameter for determining the cost function of query processing. Good estimates of these parameters allow the optimizers to choose the least expensive path in the query execution. A method for estimating the join and projection selectivity factors based on the orthogonal polynomial series is presented. Experimental results on real data are also reported which show the good performance of the approach.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Bell DA, Ling DHO, McClean S (1989) Pragmatic estimation of join sizes and attribute correlations. In: Proceedings of the IEEE 5th ICDE conference, Computer Society Press, Los Alamitos, pp 76–84
Chaudhuri S, Narasayya VR (2001) Automating statistics management for query optimizers. IEEE TKDE 13:7–20
Chaudhuri S, Das G, Datar M, Motwani R, Narasayya V (2001) Overcoming limitations of sampling for aggregation queries. In: Proceedings of the IEEE ICDE conference, Computer Society Press, Los Alamitos, pp 534–542
Chen CM, Roussopoulos N (1994) Adaptive selectivity estimation using query feedback. In: Proceedings of the ACM SIGMOD conference, pp 161–172
Ciaccia P, Maio D (1995) Domains and active domains: what this distinction implies for the estimation of projection sizes in relational databases. IEEE TKDE 4:641–654
Gardy D, Puech C (1984) On the sizes of projections a generating functions approach. Inf Syst 9:231–235
Gibbons PB (2001) Distinct sampling for highly-accurate answers to distinct values queries and event reports. In: Proceedings of the 27th VLDB conference, Roma, Italy, pp 541–550
Haas PJ, Naughton JF, Seshadri S, Stokes L (1995) Sampling-based estimation of the number of distinct values of an attribute. In: Proceedings of the 21th VLDB conference, Morgan Kaufmann Publishers, San Francisco, CA, pp 311–322
Ioannidis YE, Christodoulakis S (1993) Optimal histograms for limiting worst-case error propagation in the size of the join results. ACM TODS 18:709–748
Ioannidis YE, Poosala V (1995) Balancing histogram optimality and practicality for query result size estimation. In: Proceedings of the ACM SIGMOD conference, pp 233–244
Jiang Z, Luo C, Hou W-C, Yan F, Zhu Q (2006) Estimating aggregate join queries over data streams using discrete cosine transform. In: Proceedings of the DEXA conference, pp 182–192
Lefons E, Merico A, Tangorra F (1995) Analytical profile estimation in database systems. Inf Syst 20:1–20
Ling Y, Sun W (1995) A comprehensive evaluation of sampling-based size estimation. In: Proceedings of the IEEE 11th ICDE conference, pp 532–539
Ling Y, Sun W (1999) A hybrid estimator for selectivity estimation. IEEE TKDE 11:338–354
Merrett TH, Otoo E (1979) Distribution models of relations. In: Proceedings of the 5th VLDB conference, pp 418–425
Mullin JK (1993) Estimating the size of a relational join. Inf Syst 18:189–196
Sun W, Ling Y, Rishe N, Deng Y (1993) An instant and accurate size estimation method for joins and selection in retrieval-intensive environment. In: Proceedings of the ACM SIGMOD conference, pp 79–88
Vitter JS, Wang M (1999) Approximate computation of multidimensional aggregates of sparse data using wavelets. In: Proceedings of the ACM SIGMOD conference, pp 193–204
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer Science+Business Media, LLC
About this paper
Cite this paper
dell’ Aquila, C., Lefons, E., Tangorra, F. (2009). Estimating Join and Projection Selectivity Factors. In: Mastorakis, N., Mladenov, V., Kontargyri, V. (eds) Proceedings of the European Computing Conference. Lecture Notes in Electrical Engineering, vol 28. Springer, Boston, MA. https://doi.org/10.1007/978-0-387-85437-3_27
Download citation
DOI: https://doi.org/10.1007/978-0-387-85437-3_27
Published:
Publisher Name: Springer, Boston, MA
Print ISBN: 978-0-387-84818-1
Online ISBN: 978-0-387-85437-3
eBook Packages: EngineeringEngineering (R0)