Abstract
Replication of data or processes is an effective way to provide enhanced performance, high availability and fault tolerance in distributed systems. For instance, in systems based on the client-server model, a server may serve many clients and because of heavy loads, the server cannot respond to the requests on time. In such a case, replicating data or servers may improve performance. Moreover, data and processes can be replicated to protect against failures. However, this is a very complex procedure. In this paper, I propose a method, to make systems fault tolerant based on replication, by way of exploiting the use of collaborative agents. This method is also used to improve fault tolerance in multi-agent systems.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
The ISIS Project, http://www.cs.cornell.edu/Info/Projects/Isis/
The Horus Project, http://www.cs.cornell.edu/Info/Projects/HORUS/
Cristian, F., et al.: Fault-Tolerance in the Advanced Automation System. In: 20th International Conference on Fault-Tolerant Computing, Newcastle upon Tyne, England (1990)
Mishra, S.: Consul: A Communication Substrate for Fault-Tolerant Distributed Programs. Ph.D thesis, Dept. of Computer Science, Univ. of Arizona, Tuscon, Arizona (1992)
Schneider, F.: Implementing Fault-Tolerant Services Using the State Machine Approach: A Tutorial. ACM Computing Surveys 22(4), 299–319 (1990)
Elmootazbellah, N., et al.: Replicated Distributed Processes. In: Proceedings of the Twenty-Second International Symposium on Fault Tolerant Computing (FTCS-22), pp. 18–27 (1992)
Shokri, E., et al.: An Approach for Adaptive Fault-Tolerance in Object-Oriented Open Distributed Systems. In: Workshop on Object-Oriented Reliable Distributed Systems (1997)
Ren, J., Cukier, M., Rubel, P., Sanders, W., Karr, D.: Building Dependable Distributed Applications using AQuA. In: Proceeding of the 4th IEEE International Symp. On High Assurance Systems Engineering, pp. 189–196 (1999)
Tanenbaum, A.S., van Steen, M.: Distributed Systems: Principles and Paradigms. Prentice-Hall, Englewood Cliffs (2002)
Chockler, G.V., Keidar, I., Vitenberg, R.: Group Communication Specifications: A Comprehensive Study. ACM Computing Surveys 33(4), 1–43 (2001)
Grasshopper Programmer’s Guide, http://www.grasshopper.de
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Bora, S. (2006). A Fault Tolerant System Using Collaborative Agents. In: Savacı, F.A. (eds) Artificial Intelligence and Neural Networks. TAINN 2005. Lecture Notes in Computer Science(), vol 3949. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11803089_25
Download citation
DOI: https://doi.org/10.1007/11803089_25
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-36713-0
Online ISBN: 978-3-540-36861-8
eBook Packages: Computer ScienceComputer Science (R0)