Abstract
This paper proposes a novel scheme, called ER-TCP, which transparently masks the failures on the server nodes in a cluster from clients at TCP connection level. Connections at the server side are actively and fully replicated to remain consistency. A log mechanism is designed to cooperate with the replication to achieve small sacrifice on the performance of communication and makes the scheme scale beyond a few nodes, even when they have different processing capacities. The scheme is justified by experiments conducted on prototype implementation.
This paper is supported by National 863 Hi-Tech R&D Project under grant No.2002AA1Z2102.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Alvisi, L., Bressoud, T.C., El-Khashab, A., Marzullo, K., Zagorodnov, D.: Wrapping Server-Side TCP to Mask Connection Failures. In: Proceedings of IEEE INFOCOM, Anchorage, Alaska, USA, pp. 329–337 (2001)
Armstrong, S., Freier, A., Marzullo, K.: Multicast Transport Protocol, Internet RFC 1301, IETF (February 1992)
Burton-Krahn, N.: HotSwap - Transparent Server Failover for Linux. In: Proceedings of USENIX Sixteenth Systems Administration Conference (LISA 2002), Berkeley, California, November 2002, pp. 205–212 (2002)
Chandra, T.D., Toueg, S.: Unreliable failure detectors for asynchronous systems. In: Proceedings of the 10th ACM Symposium on Principles of Distributed Computing, Montreal, Quebec, Canada, pp. 325–340 (1991)
Cisco White Papers, http://www.cisco.com
Floyd, S., Jacobson, V., Liu, C., McCanne, S., Zhang, L.: A Reliable Multicast Framework for Lightweighted Sessions and Application Level Framing. IEEE/ACM Transactions on Networking 5(6), 784–803 (1997)
Linux Virtual Server, http://linuxvirtualserver.org
Marwah, M., Mishra, S., Fetzer, C.: TCP Server Fault Tolerance Using Connection Migration to a Backup Server. In: Proceedings of the 2003 IEEE International Conference on Dependable Systems and Networks (DSN 2003), San Francisco, CA, pp. 373–382 (2003)
Schulzrinne, H., Casner, S., Frederick, R., Jacobson, V.: RTP: A Transport Protocol for Real-Time Applications, Internet RFC 1889 (1996)
Shao, Z., Jin, H., Chen, B., Xu, J., Yue, J.: HARTS: High Availability Cluster Architecture with Redundant TCP Stacks. In: Proceedings of the International Performance Computing and Communication Conference (IPCCC 2003), Phoenix, Arizona, USA, pp. 255–262 (2003)
Shenoy, G., Satapati, S.K., Bettati, R.: HydraNet-FT: Network Support for Dependable Services. In: Proceedings of the 20th IEEE International Conference on Distributed Computing Systems (ICDCS 2000), Taipei, pp. 699–706 (2000)
Snell, Q.O., Mikler, A., Gustafson, J.L.: Netpipe: A Network Protocol Independent Performance Evaluator. In: Proceedings of IASTED International Conference on Intelligent Information Management and Systems, Washington, DC, USA, pp. 196–204 (1996)
Sultan, F., Srinivasan, K., Iyer, D., Iftode, L.: Migratory TCP: Connection migration for service continuity in the Internet. In: Proceedings of the International Conference on Distributed Computing Systems (ICDCS 2002), Vienna, Austria, pp. 469–470 (2002)
Whang, Z., Crowcroft, J., Diot, C., Ghosh, A.: Framework for Reliable Multicast Application Design. In: Proceedings of High Performance Protocol Architecture (HIPPARCH), Uppsala, Sweden, pp. 123–131 (1997)
Zhang, R., Abdelzaher, T.F., Stankovic, J.A.: Efficient TCP connection failover in web server clusters. In: Proceedings of the IEEE INFOCOM, Hong Kong, China, pp. 1220–1229 (2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Shao, Z., Jin, H., Cheng, B., Jiang, W. (2005). ER-TCP: An Efficient Fault-Tolerance Scheme for TCP Connections. In: Pan, Y., Chen, D., Guo, M., Cao, J., Dongarra, J. (eds) Parallel and Distributed Processing and Applications. ISPA 2005. Lecture Notes in Computer Science, vol 3758. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11576235_19
Download citation
DOI: https://doi.org/10.1007/11576235_19
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-29769-7
Online ISBN: 978-3-540-32100-2
eBook Packages: Computer ScienceComputer Science (R0)