Skip to main content

Estimating Join and Projection Selectivity Factors

  • Conference paper
  • First Online:
Book cover Proceedings of the European Computing Conference

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 28))

  • 602 Accesses

Abstract

The selectivity factor of relational operations is a critical parameter for determining the cost function of query processing. Good estimates of these parameters allow the optimizers to choose the least expensive path in the query execution. A method for estimating the join and projection selectivity factors based on the orthogonal polynomial series is presented. Experimental results on real data are also reported which show the good performance of the approach.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Bell DA, Ling DHO, McClean S (1989) Pragmatic estimation of join sizes and attribute correlations. In: Proceedings of the IEEE 5th ICDE conference, Computer Society Press, Los Alamitos, pp 76–84

    Google Scholar 

  2. Chaudhuri S, Narasayya VR (2001) Automating statistics management for query optimizers. IEEE TKDE 13:7–20

    Google Scholar 

  3. Chaudhuri S, Das G, Datar M, Motwani R, Narasayya V (2001) Overcoming limitations of sampling for aggregation queries. In: Proceedings of the IEEE ICDE conference, Computer Society Press, Los Alamitos, pp 534–542

    Google Scholar 

  4. Chen CM, Roussopoulos N (1994) Adaptive selectivity estimation using query feedback. In: Proceedings of the ACM SIGMOD conference, pp 161–172

    Google Scholar 

  5. Ciaccia P, Maio D (1995) Domains and active domains: what this distinction implies for the estimation of projection sizes in relational databases. IEEE TKDE 4:641–654

    Google Scholar 

  6. Gardy D, Puech C (1984) On the sizes of projections a generating functions approach. Inf Syst 9:231–235

    Article  Google Scholar 

  7. Gibbons PB (2001) Distinct sampling for highly-accurate answers to distinct values queries and event reports. In: Proceedings of the 27th VLDB conference, Roma, Italy, pp 541–550

    Google Scholar 

  8. Haas PJ, Naughton JF, Seshadri S, Stokes L (1995) Sampling-based estimation of the number of distinct values of an attribute. In: Proceedings of the 21th VLDB conference, Morgan Kaufmann Publishers, San Francisco, CA, pp 311–322

    Google Scholar 

  9. Ioannidis YE, Christodoulakis S (1993) Optimal histograms for limiting worst-case error propagation in the size of the join results. ACM TODS 18:709–748

    Article  Google Scholar 

  10. Ioannidis YE, Poosala V (1995) Balancing histogram optimality and practicality for query result size estimation. In: Proceedings of the ACM SIGMOD conference, pp 233–244

    Google Scholar 

  11. Jiang Z, Luo C, Hou W-C, Yan F, Zhu Q (2006) Estimating aggregate join queries over data streams using discrete cosine transform. In: Proceedings of the DEXA conference, pp 182–192

    Google Scholar 

  12. Lefons E, Merico A, Tangorra F (1995) Analytical profile estimation in database systems. Inf Syst 20:1–20

    Article  Google Scholar 

  13. Ling Y, Sun W (1995) A comprehensive evaluation of sampling-based size estimation. In: Proceedings of the IEEE 11th ICDE conference, pp 532–539

    Google Scholar 

  14. Ling Y, Sun W (1999) A hybrid estimator for selectivity estimation. IEEE TKDE 11:338–354

    Google Scholar 

  15. Merrett TH, Otoo E (1979) Distribution models of relations. In: Proceedings of the 5th VLDB conference, pp 418–425

    Google Scholar 

  16. Mullin JK (1993) Estimating the size of a relational join. Inf Syst 18:189–196

    Article  Google Scholar 

  17. Sun W, Ling Y, Rishe N, Deng Y (1993) An instant and accurate size estimation method for joins and selection in retrieval-intensive environment. In: Proceedings of the ACM SIGMOD conference, pp 79–88

    Google Scholar 

  18. Vitter JS, Wang M (1999) Approximate computation of multidimensional aggregates of sparse data using wavelets. In: Proceedings of the ACM SIGMOD conference, pp 193–204

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Carlo dell’ Aquila .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2009 Springer Science+Business Media, LLC

About this paper

Cite this paper

dell’ Aquila, C., Lefons, E., Tangorra, F. (2009). Estimating Join and Projection Selectivity Factors. In: Mastorakis, N., Mladenov, V., Kontargyri, V. (eds) Proceedings of the European Computing Conference. Lecture Notes in Electrical Engineering, vol 28. Springer, Boston, MA. https://doi.org/10.1007/978-0-387-85437-3_27

Download citation

  • DOI: https://doi.org/10.1007/978-0-387-85437-3_27

  • Published:

  • Publisher Name: Springer, Boston, MA

  • Print ISBN: 978-0-387-84818-1

  • Online ISBN: 978-0-387-85437-3

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics