Abstract
ZooKeeper atomic broadcast (Zab) is at the core of ZooKeeper system, enforcing a total order on service requests that seek to modify the replicated service state. Since it is a leader based protocol, its performance degrades when the leader server is made to handle an increased message traffic. We address this concern by having the other, non-leader server replicas toss a coin and broadcast their acknowledgement of a leader’s proposal only if the toss results in an outcome of Head. We model the coin-tossing process and derive analytical expressions for estimating the coin’s probability of Head for a given arrival rate of service requests such that the dual objectives of performance gains and traffic reduction can be accomplished. Experiments compare the performance of our coin-tossing Zab version (ZabCT) with Zab performance and confirm that the dual objectives are demonstrably met under heavy workloads. Moreover, ZabCT meets all requirements essential for crash-tolerance provisions within Zab which can therefore be adopted in any ZabCT implementation.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Ban, B.: Jgroups, a toolkit for reliable multicast communication (2002). http://www.jgroups.org
Biely, M., Milosevic, Z., Santos, N., Schiper, A.: S-paxos: offloading the leader for high throughput state machine replication. In: IEEE 31st Symposium on Reliable Distributed Systems (SRDS), pp. 111–120 (2012)
Chandra, T.D., Griesemer, R., Redstone, J.: Paxos made live: an engineering perspective. In: Proceedings of the Twenty-Sixth Annual ACM Symposium on Principles of Distributed Computing, pp. 398–407 (2007)
Défago, X., Schiper, A., Urbán, P.: Total order broadcast and multicast algorithms: taxonomy and survey. ACM Comput. Surv. (CSUR) 36(4), 372–421 (2004)
Emerson, R., Ezhilchelvan, P.: An atomic-multicast service for scalable in-memory transaction systems. In: 2014 IEEE 6th International Conference on Cloud Computing Technology and Science (CloudCom), pp. 743–746. IEEE (2014)
George, L.: HBase: The Definitive Guide. O’Reilly Media, Inc., Sebastopol (2011)
Hunt, P., Konar, M., Junqueira, F.P., Reed, B.: Zookeeper: wait-free coordination for internet-scale systems. In: USENIX Annual Technical Conference, vol. 8, p. 9 (2010)
Junqueira, F.P., Reed, B.C., Serafini, M.: Zab: high-performance broadcast for primary-backup systems. In: IEEE/IFIP 41th International Conference on Dependable Systems & Networks (DSN), pp. 245–256. IEEE (2011)
Lamport, L.: Time, clocks, and the ordering of events in a distributed system. Commun. ACM 21(7), 558–565 (1978)
Lamport, L.: Paxos made simple. ACM Sigact News 32(4), 18–25 (2001)
Lamport, L.: Fast paxos. Distrib. Comput. 19(2), 79–103 (2006)
Mao, Y., Junqueira, F.P., Marzullo, K.: Mencius: building efficient replicated state machines for WANs. OSDI 8, 369–384 (2008)
Reed, B., Junqueira, F.P.: A simple totally ordered broadcast protocol. In: proceedings of the 2nd Workshop on Large-Scale Distributed Systems and Middleware (LADIS), pp. 1–6. ACM (2008)
Ruivo, P., Couceiro, M., Romano, P., Rodrigues, L.: Exploiting total order multicast in weakly consistent transactional caches. In: IEEE 17th Pacific Rim International Symposium on Dependable Computing (PRDC), pp. 99–108 (2011)
Shvachko, K., Kuang, H., Radia, S., Chansler, R.: The hadoop distributed file system. In: IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST), vol. 2, pp. 1–10 (2010)
Toshniwal, A., Taneja, S., Shukla, A., Ramasamy, K., Patel, J.M., Kulkarni, S., Jackson, J., Gade, K., Fu, M., Donham, J.: Storm@twitter. In: Proceedings of the 2014 ACM SIGMOD International Conference on Management of Data, pp. 147–156 (2014)
Van Renesse, R., Schneider, F.B.: Chain replication for supporting high throughput and availability. OSDI 4, 91–104 (2004)
Author information
Authors and Affiliations
Corresponding authors
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
EL-Sanosi, I., Ezhilchelvan, P. (2017). Improving ZooKeeper Atomic Broadcast Performance by Coin Tossing. In: Reinecke, P., Di Marco, A. (eds) Computer Performance Engineering. EPEW 2017. Lecture Notes in Computer Science(), vol 10497. Springer, Cham. https://doi.org/10.1007/978-3-319-66583-2_16
Download citation
DOI: https://doi.org/10.1007/978-3-319-66583-2_16
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-66582-5
Online ISBN: 978-3-319-66583-2
eBook Packages: Computer ScienceComputer Science (R0)