This chapter explains hardware faults, their origins, dependency on technology used, and some known solutions of fault toleration using error correction codes and redundancy hardware schemes. Shown that with growing density of hardware there is a risk of multiple temporary fault which grows at order of magnitude prime concern for designers of new computer systems for safety-critical application. Hardware faults occur due to natural phenomena such as ionized radiation, variations in the manufacturing process, vibrations, etc. We present in this chapter a short introduction to hardware faults, show the typical fault types and patterns, and also give examples of how to deal with these faults.
- 2.Constantinescu C (2008) Intermittent faults and effects on reliability of integrated circuits. In: RAMS 2008. Annual, pp 370–374, Jan 2008Google Scholar
- 3.Wells P, Chakraborty K, Sohi GS (2007) Adapting to intermittent faults in future multicore systems. In: 16th international conference on parallel architecture and compilation techniques, PACT, p 431, Sept 2007Google Scholar
- 4.Hamming R (1950) Error detection and error correction codes. Bell Syst Tech J XXVI(9):147–160Google Scholar
- 7.Birolini A (2004) Reliability engineering theory and practice, 8th edn. SpringerGoogle Scholar
- 8.Schagaev I, Castano V (2015) Resilient computer system design. Springer. ISBN 978-3-319-15069-7Google Scholar