Abstract
One of the important issues in range partitioning schemes is data skew. Tuples distribution across nodes may be skewed (some nodes have many tuples, while others may have fewer tuples). Processing skewed data not only slows down the response time, but also generates hot nodes. In such a situation, data may need to be moved from the most-loaded partitions to the least-loaded ones in order to achieve storage balancing requirements. Early works from the State-of-The-Art focused on achieving load balancing. However, today’s works focus on reducing the load balancing cost. This latter involves reducing the cost of maintaining partition statistics. In this context, we propose to improve one of the best load balancing work, that is the one of Ganesan et al., to reduce the cost of maintaining the statistics of load balancing. We introduce the concept of fuzzy system image. Both nodes and clients have approximate information about the load distribution. They can nevertheless locate any data with almost the same efficiency as using exact partition statistics. Furthermore, maintaining load distribution statistics do not require exchanging additional messages as opposed to the cost of efficient solutions from the State-of-The-Art (which requires at least \(\mathbb {O}(\log {n})\) messages).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Bensberg, C., Becker, J., Mueller, C., Thumfart, A.: Dynamic range partitioning, US Patent App. 14/463,060, 19 August 2014
Ganesan, P., Bawa, M., Garcia-Molina, H.: Online balancing of range-partitioned data with applications to peer-to-peer systems. In: Proceedings of the Thirtieth International Conference on Very Large Data Bases, vol. 30, pp. 444–455. VLDB Endowment (2004)
Aspnes, J., Shah, G.: Skip graphs. ACM Trans. Algorithms (TALG) 3(4), 37 (2007)
Pugh, W.: Skip lists: a probabilistic alternative to balanced trees. Commun. ACM 33(6), 668–676 (1990)
Felber, P., Kropf, P., Schiller, E., Serbu, S.: Survey on load balancing in peer-to-peer distributed hash tables. IEEE Commun. Surv. Tutorials 16(1), 473–492 (2014)
Mirrezaei, S.I., Shahparian, J.: Data load balancing in heterogeneous dynamic networks. arXiv preprint arXiv:1602.04536 (2016)
Chawachat, J., Fakcharoenphol, J.: A simpler load-balancing algorithm for range-partitioned data in peer-to-peer systems. Networks 66(3), 235–249 (2015)
Antoine, M., Pellegrino, L., Huet, F., Baude, F.: A generic API for load balancing in structured P2P systems. In: 2014 International Symposium on Computer Architecture and High Performance Computing Workshop (SBAC-PADW), pp. 138–143. IEEE (2014)
Takeda, A., Oide, T., Takahashi, A., Suganuma, T.: Efficient dynamic load balancing for structured P2P network. In: 2015 18th International Conference on Network-Based Information Systems (NBiS), pp. 432–437. IEEE (2015)
Mizutani, K., Inoue, T., Mano, T., Akashi, O., Matsuura, S., Fujikawa, K.: Stable load balancing with overlapping ID-space management in range-based structured overlay networks. Inf. Media Technol. 11, 1–10 (2016)
Harvey, N.J.A., Jones, M.B., Saroiu, S., Theimer, M., Wolman, A.: Skipnet: a scalable overlay network with practical locality properties. Networks 34(38) (2003)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Belayadi, D., Hidouci, KW., Midoun, K. (2019). CARP: Cost Effective Load-Balancing Approach for Range-Partitioned Data. In: Demigha, O., Djamaa, B., Amamra, A. (eds) Advances in Computing Systems and Applications. CSA 2018. Lecture Notes in Networks and Systems, vol 50. Springer, Cham. https://doi.org/10.1007/978-3-319-98352-3_30
Download citation
DOI: https://doi.org/10.1007/978-3-319-98352-3_30
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-98351-6
Online ISBN: 978-3-319-98352-3
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)