Advertisement

Distributed Systems: Maximizing Resilience

  • Igor SchagaevEmail author
Chapter

Abstract

We claim that system redundancy (natural or artificial, deliberately introduced) should be applied for the purposes of Performance, Reliability and Energy efficiency or “PRE-smartness”. A process of an algorithm of application of available redundancy for PRE-smartness is proposed showing that it is further extension generalized algorithm of fault tolerance, when property of fault detection is replaced as property of PRE-smartness. We present a system level implementation steps of PRE-smartness. Using ITACS LTD forward and backward tracing algorithms applied for distributed computer system we demonstrate that efficiency (reliability as a whole, especially availability; performance of detection and recovery) grow up to the level when real-time applications of distributed system grows substantially. Ans estimation of reliability gain is modeled and demonstrated suggesting a policy of implementation of PRE-smartness as a permanent on-going process required for the system successful functioning.

References

  1. 1.
    Blaeser L, Monkman S, Schagaev I Vision on reconfigurable systems, chapter 10Google Scholar
  2. 2.
    Schagaev I, Monkman S (2013) Redundancy + Reconfigurability = Recoverability. Electronics 2:212–233.  https://doi.org/10.3390/electronics2030212CrossRefGoogle Scholar
  3. 3.
    Castano V, Schagaev I Resilient computer system design.  https://doi.org/10.1007/978-3-319-15069-7CrossRefGoogle Scholar
  4. 4.
    Schagaev I, Kaegi-Trachsel T Software design for resilient computer systems.  https://doi.org/10.1007/978-3-319-29465-0CrossRefGoogle Scholar
  5. 5.
    Schagaev I, Kirk B Active system control.  https://doi.org/10.1007/978-3-319-29465-0CrossRefGoogle Scholar
  6. 6.
  7. 7.
  8. 8.
    Leon-Garsia A et al Communication networks. McGraw Hill, ISBN-0-07-246352Google Scholar
  9. 9.
  10. 10.
    Flynn MJ et al (1996) Parallel architectures. ACM Comput Surv 28(1)CrossRefGoogle Scholar
  11. 11.
    Schagaev I (1990) Yet another approach to classification of redundancy. In: CIM IMEKO symposium 1990, Helsinki, pp 117–124Google Scholar
  12. 12.
    Schagaev I, Sogomonoyan E (1988) Hardware and software for a fault tolerant computing system. Automat Remote Control 49(2), Part 1, Pergamon PressGoogle Scholar
  13. 13.
  14. 14.

Copyright information

© Springer Nature Switzerland AG 2020

Authors and Affiliations

  1. 1.IT-ACS LtdStevenageUK

Personalised recommendations