Skip to main content

Operating systems and fault-tolerance

  • Conference paper
  • First Online:
Operating Systems of the 90s and Beyond

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 563))

  • 142 Accesses

Abstract

The key to constructing fault-tolerant software is managing the extra complexity through the use of appropriate programming paradigms, abstractions, and system structuring techniques. While significant progress has been made in these areas, it is clear that additional work is required to bring the level of software sophistication in this area up to that found elsewhere. This is a problem not just for the new hardware architectures that are now coming into play, but also for current hardware architectures. However, no matter what advances are made, the inherent complications induced by the need to worry about failures will always make the design, implementation, and verification of fault-tolerant systems more complex than other systems.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Laprie, J.C. (ed.), Dependability: Basic Concepts and Associated Terminology, Vol. 5, Dependable Computing and Fault-Tolerant Systems Series, Springer-Verlag, Vienna, to appear.

    Google Scholar 

  2. Neumann, P.G., Illustrative risks to the public in the use of computer systems and related technology, Software Engineering Notes 16, 1 (Jan. 1991), 2–9.

    Google Scholar 

  3. Schneider, F.B., Implementing fault-tolerant services using the state machine approach: A tutorial. ACM Computing Surveys 22, 4 (Dec. 1990), 299–319.

    Article  Google Scholar 

  4. Liskov, B., and Scheifler, R. Guardians and Actions: Linguistic Support for Robust Distributed Programs, ACM Trans. on Programming Languages and Systems 5, 3 (July 1983), 381–404.

    Article  Google Scholar 

  5. Randell, B., Lee, P.A., and Treleaven, P.C., Reliability Issues in Computing System Design, ACM Computing Surveys 10, 2 (June 1978), 123–166.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Arthur Karshmer Jürgen Nehmer

Rights and permissions

Reprints and permissions

Copyright information

© 1991 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Schlichting, R.D. (1991). Operating systems and fault-tolerance. In: Karshmer, A., Nehmer, J. (eds) Operating Systems of the 90s and Beyond. Lecture Notes in Computer Science, vol 563. Springer, Berlin, Heidelberg. https://doi.org/10.1007/BFb0024535

Download citation

  • DOI: https://doi.org/10.1007/BFb0024535

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-54987-1

  • Online ISBN: 978-3-540-46630-7

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics