Skip to main content

Practical Aspects of Preventive Maintenance and Software Rejuvenation

  • Chapter
  • First Online:
Stochastic Models for Fault Tolerance
  • 530 Accesses

Abstract

In the design and development process of complex systems stochastic modelling and simulation are part of an iterative procedure. Often, before implementation the design of a new product is evaluated by means of formal and stochastic modelling. Later in the product development process prototypes are modelled, evaluated and improved. Therefore, stochastic modelling must follow or accompany the system development process. As stochastic modelling operates on a different level of abstraction than system development, the latter must not be observed in all technical detail. Technical system development issues for preventive maintenance and software rejuvenation in particular are at least as diverse as are the presented stochastic models. An exhaustive study of the history of the system development process is therefore far beyond the scope of this work. We will only briefly introduce main systems aspects of preventive maintenance as to ease later discussion of the related stochastic models.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 54.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Reference

  1. Y.-M. Wang, Y. Huang, K.-P. Vo, P.-Y. Chung, C.M.R. Kintala, Checkpointing and Its applications. In FTCS-25: Proceedings of the 25th International Symposium on Fault-Tolerant Computing, Pasadena, CA, June 1995 (IEEE Computer Society, Los Alamitos, CA, 1995), pp. 22–31

    Google Scholar 

  2. B.C. Ling, E. Kiciman, A. Fox, Session State: Beyond Soft State. In NSDI’04: Proceedings of the 1st Symposium on Network Systems Design and Implementation (NSDI), San Francisco, CA, March 2004 (USENIX Association, Berkely, CA, 2004), pp. 295–308

    Google Scholar 

  3. I. Gertsbakh, Reliability Theory, with Applications to Preventive Maintenance (Springer, Berlin, 2000)

    MATH  Google Scholar 

  4. S. Garg, A. van Moorsel, K. Vaidyanathan, K.S. Trivedi, A Methodology for Detection and Estimation of Software Aging. In ISSRE’98: Proceedings of the International Symposium on Software Reliability Engineering, Paderborn, Germany, November 1998 (IEEE Computer Society, Los Alamitos, CA, 1998), pp. 283–292

    Google Scholar 

  5. V. Sundaram, S.H. Chaudhuri, S. Garg, C. Kintala, S. Bagchi, Improving Dependability Using Shared Supplementary Memory and Opportunistic Micro Rejuvenation in Multi-Tasking Embedded Systems. In PRDC’07: Proceedings of the 13th Pacific Rim International Symposium on Dependable Computing, Melbourne, VIC, Australia (IEEE Computer Society, Washington, DC, 2007), pp. 240–247

    Google Scholar 

  6. V. Paxson, M. Allmann, Computing TCP’s retransmission timer (RFC 2988, IETF, Nov 2000), http://www.rfc-editor.org/rfc/rfc2988.txt

  7. G. Candea, J. Cutler, A. Fox, R. Doshi, P. Garg, R. Gowda, Reducing Recovery Time in a Small Recursively Restartable System. In DSV’02: Proceedings of the International Conference on Dependable Systems and Networks, Bethesda, MD, June 2002 (IEEE Computer Society, Washington, DC, 2002)

    Google Scholar 

  8. A. Fox, D. Patterson, When Does Fast Recovery Trump High Reliability. In EASY’02: Proceedings of the 2nd Workshop on Evaluating and Architecting System Dependability, San Jose, CA, October 2002

    Google Scholar 

  9. M. Hamill, K. Goseva-Popstojanova, Common trends in software fault and failure data. IEEE Trans. Software Eng. 35(4), 484–496 (2009)

    Article  Google Scholar 

  10. J.J. McCall, Maintenance policies for stochastically failing equipment: A survey. Manage. Sci. 11, 493–521 (1965)

    MathSciNet  MATH  Google Scholar 

  11. R. Matias Jr., P.J.F. Filho, An Experimental Study on Software Aging and Rejuvenation in Web Servers. In COMPSAC’06: Proceedings of the 30th Annual International Computer Software and Applications Conference, Chicago, IL (IEEE Computer Society, Washington, DC, 2006), pp. 189–196

    Google Scholar 

  12. Y. Huang, C. Kintala, N. Kolettis, N.D. Fulton, Software Rejuvenation: Analysis, Module and Applications. In Proceedings of the 25th Symposium on Fault Tolerant Computing, Pasadena, CA, June 1995 (IEEE Computer Society, Los Alamitos, CA, 1995), pp. 381–390

    Google Scholar 

  13. G. Candea, S. Kawamoto, Y. Fujiki, G. Friedman, A. Fox, Microreboot: A Technique for Cheap Recovery. In OSDI’04: Proceedings of the USENIX/ACM Symposium on Operating Systems Design & Implementation, San Francisco, CA, December 2004 (USENIX Association, Berkely, CA)

    Google Scholar 

  14. G. Candea, A.B. Brown, A. Fox, D.A. Patterson, Recovery-oriented computing: Building multitier dependability. IEEE Comput. 37(11), 60–67 (2004)

    Article  Google Scholar 

  15. C. Valedez-Flores, R.M. Feldman, A survey of preventive maintenance models for stochastically deteriorating single-unit systems. Naval Res. Logist. 36, 419–446 (1989)

    Article  MathSciNet  Google Scholar 

  16. M. Poniatowski, UNIX User’s Handbook (Prentice-Hall, Upper Saddle River, NJ, 2001)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Katinka Wolter .

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Wolter, K. (2010). Practical Aspects of Preventive Maintenance and Software Rejuvenation. In: Stochastic Models for Fault Tolerance. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-11257-7_6

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-11257-7_6

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-11256-0

  • Online ISBN: 978-3-642-11257-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics