Abstract
In a distributed environment relations are stored at different sites. To perform algebraic operations such as join, the relations are to be transferred from one site to the other in such a way that the total communication cost is minimized. This paper deals with the problem of computing the transmission cost using two approaches. The first uses System R* algorithm approach when the data is of non-skew nature and the second uses SharesSkew algorithm when the data has skews i.e., same value for a specific join attribute, named as Heavy Hitter (HH). Rules of the two algorithms to be followed for performing join are specified and by illustrating with Banking System, the communication cost is evaluated.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Ullman DJ (1984) Principles of database systems, 2nd edn. Galgotia Publications
Afrati NF, Stasinopoulos N, Ullman DJ (2018) SharesSkew: an algorithm to handle skew for joins in MapReduce. Inf Syt 7:129–150 (Elsevier)
Afrati NF, Ullman DJ (2011) Optimizing multiway joins in map-reduce environment. In: IEEE transactions on knowledge and data engineering, vol 23(9), pp 1282–1298
Beame P, Koutris P, Suciu D (2014) Skew in parallel query processing. In: Proceedings of the 33rd ACM SIGMOD symposium on principle of database systems, USA, pp 212–223
Chu S, Balazinska M, Suciu D (2015) From theory to practice: efficient join query evaluation in a parallel database system. In: Proceedings of the 2015 ACM SIDMOD international conference on management of data, ACM
Kwon Y, Balazinska M, Howe B, Rolia J (2012) SkewTune: mitigating skew in mapreduce applications. In: Proceedings of the 2012 ACM SIDMOD international conference on management of data, pp 25–36, ACM, USA
Kwon Y, Balazinska M, Howe B, Rolia J (2012) SkewTune in action: mitigating skew in mapreduce applications. PVLDB 5(12):1934–1937
Ullman DJ (2012) Designing good mapreduce algorithms. XRDS 19(1):30–34
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Krishna, C.L., Reddy, P.V.S. (2020). Optimization of Multi-way Join Cost Using System R* and SharesSkew. In: Kumar, A., Paprzycki, M., Gunjan, V. (eds) ICDSMLA 2019. Lecture Notes in Electrical Engineering, vol 601. Springer, Singapore. https://doi.org/10.1007/978-981-15-1420-3_5
Download citation
DOI: https://doi.org/10.1007/978-981-15-1420-3_5
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-1419-7
Online ISBN: 978-981-15-1420-3
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)