Skip to main content
Log in

Model and algorithm of backward error recovery of distributed software

  • Regular Papers
  • Published:
Journal of Computer Science and Technology Aims and scope Submit manuscript

Abstract

Backward error recovery is one of the important techniques of software fault tolerance. Because of error propagation its recovery in distributed software needs cooperation between processes to achieve consistent recovery. However, the techniques of the achievement suffer from either concurrency level decreasing or the domino effect. Based on a formal model of the distributed system, a backward recovery protocol without the two drawbacks is specified in this paper. The algorithm of the protocol is proven strictly and its implementation is proposed.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Similar content being viewed by others

References

  1. B. Randellet al., Reliability issues in computing system design,Computing Surveys,10:2 (1978).

    Google Scholar 

  2. B. Randell, System structure for software fault tolerance,IEEE Trans. SE-1:2 (1975).

    Google Scholar 

  3. P. Jalote and R. H. Campbell, Fault Tolerance Using Communicating Sequential Processes, FTCS- 14, 1984.

  4. P. Jalote and R. H. Compbell, Atomic actions for fault tolerances using CSP,IEEE Trans. SE-12:1 (1986).

    Google Scholar 

  5. S. T. Grigory, and J. C. Knight, A New Linguistic Approach To Backward Error Recovey, FTCS- 15, 1985.

  6. P. M. Merlin and B. Randell, Consistent State Restoration In Distributed Systems, FTCS- 8, 1978.

  7. W. G. Wood, Recovery Control of Communicating Processes in a Distributed System, FTCS- 11, 1981.

  8. K. Zielinsky, Model of error propagation in systems of communicating processes,Science of Computing Programming,6 (1986), 191–205.

    Article  Google Scholar 

  9. D. L. Russell, State restoration in systems of communicating processes,IEEE Trans. SE-6:2 (1980).

    Google Scholar 

  10. K. H. Kim, An Implementation of a Programmer Transparent Scheme for Coordination Concurrent Processes in Recovery, COMPSAC 1980.

  11. A. Ciuffoletti, Error Recovery in Systems of Communication Process, 7th Intern, Conf. on Softw. Eng., 1984.

  12. D. Briaticoet al., A Distributed Domino- Effect Free Recovery Algorithm, IEEE 1984 Conf. on Reliability in Distributed Software and Database Systems.

  13. B. Randell, Fault Tolerance and System Structuring, from Reliable Computer Systems, Springer- Verlag, 1985.

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

About this article

Cite this article

Liu, C., Wen, C. Model and algorithm of backward error recovery of distributed software. J. of Comput. Sci. & Technol. 4, 275–285 (1989). https://doi.org/10.1007/BF02943542

Download citation

  • Received:

  • Revised:

  • Issue Date:

  • DOI: https://doi.org/10.1007/BF02943542

Keywords

Navigation