Skip to main content

On-Line Adaptive Algorithms in Autonomic Restart Control

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNPSE,volume 6407))

Abstract

Restarts or retries are typical control schemes to meet a deadline in real-time systems, and are regarded as significant environmental diversity techniques in dependable computing. This paper reconsiders a restart control studied by van Moorsel and Wolter (2006), and refines their result from theoretical and statistical points of views. Based on the optimality principle, we show that the time-fixed restart time is best even in non-stationary control setting under the assumption of unbounded restart opportunities. Next we study statistical inference for the restart time interval and develop on-line adaptive algorithms for estimating the optimal restart time interval via non-parametric estimation and reinforcement learning. Finally, these algorithms are compared in a simulation study.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Avritzer, A., Bondi, A., Grottke, M., Weyuker, E.J., Trivedi, K.S.: Performance assurance via software rejuvenation: monitoring, statistics and algorithms. In: Proceedings of 36th Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN 2006), pp. 435–444. IEEE CS Press, Los Alamitos (2006)

    Google Scholar 

  2. Bao, Y., Sun, X., Trivedi, K.S.: Adaptive software rejuvenation: degradation model and rejuvenation scheme. In: Proceedings of 33rd Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN-2003), pp. 241–248. IEEE CS Press, Los Alamitos (2003)

    Google Scholar 

  3. Bao, Y., Sun, X., Trivedi, K.S.: A workload-based analysis of software aging, and rejuvenation. IEEE Transactions on Reliability 54(3), 541–548 (2005)

    Article  Google Scholar 

  4. Dohi, T., Gosĕva-Popstojanova, K., Trivedi, K.S.: Estimating software rejuvenation schedule in high assurance systems. Computer Journal 44(6), 473–485 (2001)

    Google Scholar 

  5. Eto, H., Dohi, T.: Analysis of a service degradation model with preventive rejuvenation. In: Penkler, D., Reitenspiess, M., Tam, F. (eds.) ISAS 2006. LNCS, vol. 4328, pp. 17–29. Springer, Heidelberg (2006)

    Google Scholar 

  6. Eto, H., Dohi, T., Ma, J.: Simulation-based optimization approach for software cost model with rejuvenation. In: Rong, C., Jaatun, M.G., Sandnes, F.E., Yang, L.T., Ma, J. (eds.) ATC 2008. LNCS, vol. 5060, pp. 206–218. Springer, Heidelberg (2008)

    Google Scholar 

  7. Garg, S., Telek, M., Puliafito, A., Trivedi, K.S.: Analysis of software rejuvenation using Markov regenerative stochastic Petri net. In: Proceedings of 6th International Symposium on Software Reliability Engineering (ISSRE 1995), pp. 24–27. IEEE CS Press, Los Alamitos (1995)

    Google Scholar 

  8. Garg, S., Pfening, S., Puliafito, A., Telek, M., Trivedi, K.S.: Analysis of preventive maintenance in transactions based software systems. IEEE Transactions on Computers 47, 96–107 (1998)

    Google Scholar 

  9. Gosavi, A.: Simulation-Based Optimization: Parametric Optimization Techniques and Reinforcement Learning. Kluwer Academic Publishers, Dordrecht (2003)

    Google Scholar 

  10. Huang, Y., Kintala, C., Kolettis, N., Fulton, N.D.: Software rejuvenation: analysis, module and applications. In: Proceedings of 25th International Symposium on Fault Tolerant Computing (FTC 1995), pp. 381–390. IEEE CS Press, Los Alamitos (1995)

    Google Scholar 

  11. Maurer, S.M., Huberman, B.A.: Restart strategies and Internet congestion. Journal of Economic Dynamics and Control 25, 641–654 (2001)

    Article  MATH  Google Scholar 

  12. Okamura, H., Nishimura, Y., Dohi, T.: A dynamic checkpointing scheme based on reinforcement learning. In: Proceedings of The 10th International Symposium on Pacific Rim Dependable Computing (PRDC 2004), pp. 151–158. IEEE CS Press, Los Alamitos (2004)

    Google Scholar 

  13. Puterman, M.: Markov Decision Processes. John Wiley & Sons, New York (1994)

    Google Scholar 

  14. Reinecke, P., van Moorsel, A., Wolter, K.: A measurement study of the interplay between application level restart and transport protocol. In: Malek, M., Reitenspiess, M., Kaiser, J. (eds.) ISAS 2004. LNCS, vol. 3335, pp. 86–100. Springer, Heidelberg (2004)

    Google Scholar 

  15. Reinecke, P., van Moorsel, A., Wolter, K.: The fast and the fair: a fault-injection-driven comparison of restart oracles for reliable web. In: Proceedings of 3rd International Conference on the Quantitative Evaluation of Systems (QEST 2006), pp. 375–384. IEEE CS Press, Los Alamitos (2006)

    Google Scholar 

  16. Rinsaka, K., Dohi, T.: A faster estimation algorithm for periodic preventive rejuvenation schedule maximizing system availability. In: Malek, M., Reitenspieß, M., van Moorsel, A. (eds.) ISAS 2007. LNCS, vol. 4526, pp. 94–104. Springer, Heidelberg (2007a)

    Google Scholar 

  17. Rinsaka, K., Dohi, T.: Non-parametric predictive inference of preventive rejuvenation schedule in operational software systems. In: Proceedings of The 18th International Symposium on Software Reliability Engineering (ISSRE 2007), pp. 247–256. IEEE CS Press, Los Alamitos (2007b)

    Google Scholar 

  18. Sutton, R.S., Barto, A.: Reinforcement Learning. MIT Press, Cambridge (1998)

    Google Scholar 

  19. Suzuki, H., Dohi, T., Goševa-Popstojanova, K., Trivedi, K.S.: Analysis of multi step failure models with periodic software rejuvenation. In: Artalejo, J.R., Krishnamoorthy, A. (eds.) Advances in Stochastic Modelling, pp. 85–108 (2002), Notable Publications

    Google Scholar 

  20. Tsai, W.-Y., Jewell, N.P., Wang, M.-C.: A note on the product-limit estimator under right censoring and left truncation. Biometrika 74(4), 883–886 (1987)

    Google Scholar 

  21. Vaidyanathan, K.V., Trivedi, K.S.: A comprehensive model for software rejuvenation. IEEE Transactions on Dependable and Secure Computing 2(2), 124–137 (2005)

    Google Scholar 

  22. van Moorsel, A., Wolter, K.: Analysis and algorithms for restart. In: Proceedings of 1st International Conference on the Quantitative Evaluation of Systems (QEST 2004), pp. 195–204. IEEE CS Press, Los Alamitos (2004)

    Google Scholar 

  23. van Moorsel, A., Wolter, K.: Optimal restart times for moments of completion time. IEE Proceedings of Software Engineering 151(5), 219–223 (2004)

    Google Scholar 

  24. van Moorsel, A., Wolter, K.: Analysis of restart mechanisms in software systems. IEEE Transactions on Software Engineering 32(8), 547–558 (2006)

    Google Scholar 

  25. Wang, D., Xie, W., Trivedi, K.S.: Performability analysis of clustered systems with rejuvenation under varying workload. Performance Evaluation 64(3), 247–265 (2007)

    Google Scholar 

  26. Yurcik, W., Doss, D.: Achieving fault-tolerant software with rejuvenation and reconfiguration. IEEE Software 18(4), 48–52 (2001)

    Google Scholar 

  27. Zhu, Q., Yuan, C.: A reinforcement learning approach to automatic error recovery. In: Proceedings of 37th Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN 2007), pp. 729–738. IEEE CS Press, Los Alamitos (2007)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Okamura, H., Dohi, T., Trivedi, K.S. (2010). On-Line Adaptive Algorithms in Autonomic Restart Control. In: Xie, B., Branke, J., Sadjadi, S.M., Zhang, D., Zhou, X. (eds) Autonomic and Trusted Computing. ATC 2010. Lecture Notes in Computer Science, vol 6407. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-16576-4_3

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-16576-4_3

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-16575-7

  • Online ISBN: 978-3-642-16576-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics