Real-Time Fault-Tolerant Communication in Distributed Computing Systems
Distributed systems with point-to-point interconnection networks are natural candidates for real-time fault-tolerant communication because parallel processing and communication as well as fault-tolerance can be achieved using multiple processors and interconnection paths between every pair of nodes. However, due to the contention among randomly-arriving messages at each node/link and multi-hops between the source and destination that a message must travel, it is difficult to guarantee the timely delivery of the messages. The goal of this chapter is to remove this difficulty and simultaneously realize the potential of distributed systems for high performance and high reliability.
KeywordsSource Node Destination Node Faulty Node Basic Circuit Primary Channel
Unable to display preview. Download preview PDF.
- Q. Zheng and K. G. Shin, “On the ability of establishing real-time channels in point-to-point packet-switched networks,” IEEE Transactions on Communication (in press), 1993.Google Scholar
- Q. Zheng, Real-time Fault-tolerant Communication in Computer Networks, PhD thesis, University of Michigan, 1993. PostScript version of the thesis is available via anonymous FTP from ftp.eecs.umich.edu in directory outgoing/zheng.Google Scholar
- D. D. Kandlur, K. G. Shin, and D. Ferrari, “Real-time communication in multi-hop networks,” in Proc. Int. Conf. on Distributed Computer Systems, pp. 300–307. IEEE, May 1991.Google Scholar
- Q. Zheng and K. G. Shin, “Real-time communication in local area ring networks,” in Conference on Local Computer Networks, pp. 416–425, September 1992.Google Scholar
- A. Indiresan and Q. Zheng, “Design and evaluation of a fast deadline scheduling switch for multicomputers,” RTCL working document, December 1991.Google Scholar
- Q. Zheng, K. G. Shin, and E. Abram-Profeta, “Transmission of compressed digital motion video over computer networks,” in Digest of COMPCON Spring, pp. 37–46, February 1993.Google Scholar
- K. G. Shin, “HARTS: A distributed real-time architecture,” IEEE Computer, vol. 24, no. 5, pp. 25–36, May 1991.Google Scholar
- D. D. Kandlur and K. G. Shin, “Design of a communication subsystem for HARTS,” Technical Report CSE-TR-109-91, CSE Division, Department of EECS, The University of Michigan, October 1991.Google Scholar