Skip to main content

Physical Fault Models and Fault Tolerance

  • Chapter
  • First Online:

Part of the book series: Frontiers in Electronic Testing ((FRET,volume 43))

Abstract

Dependable systems are obtained by means of extensive testing procedures and the incorporation of fault tolerance mechanisms encompassing error detection (on-line testing) and system recovery. In that context, the characterization of fault models that are both tractable and representative of actual faults constitute an essential basis upon which one can efficiently verify, design or assess dependable systems. On one hand, models should refer to erroneous behaviors that are as abstract and as broad as possible to allow for the definition and development of both generic fault tolerance mechanisms and cost-effective injection techniques. On the other hand, the models should definitely aim at matching the erroneous behaviors induced by real faults.

In this chapter, we focus on the representativeness of fault models with respect to physical faults for deriving relevant testing procedures as well as detection mechanisms and experimental assessment techniques. We first discuss the accuracy of logic fault models with respect to physical defects in the implementation of off-line/on-line testing mechanisms. Then, we show how the fault models are linked to the identification and implementation of relevant fault injection-based dependability assessment techniques.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD   109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

  1. 1.

    EFCIS: Société pour l’Etude et la Fabrication de Circuits Intégrés Spéciaux, that has evolved to form ST Microelectronics, in the late 1990s.

  2. 2.

    Recent work oriented towards the development of (fault injection-based) dependability benchmarks (e.g., see Kanoun and Spainhower 2008) has adapted the notions attached to the A and F domains to the ones of Workload and Faultload, respectively.

  3. 3.

    These efforts include also the IFIP WG 10.4 SIG on Dependability Benchmarking (http://www.dependability.org/wg10.4/SIGDeB) and the European Project on Dependability Benchmarking – DBench - Project IST 2000–25425 (http://www.laas.fr/dbench).

  4. 4.

    MARS (MAintainable Real-time System) is the distributed system developed at Vienna Univ. of Technology, that has evolved to the TTA and TTP concepts (Kopetz and Bauer 2003).

  5. 5.

    Fail silence is intended to describe the behavior of a computer that fails “cleanly” by just stopping to send messages in case a failure occurs (Powell, 1994).

  6. 6.

    Such a direct control on the injected fault is not possible in the case of the software-implemented fault injection technique used (see Section 8.3.2.4).

  7. 7.

    If the error was not detected by the NUT itself, then the node has no error information available and sends only a status message.

  8. 8.

    Due to the small number of NMIs observed, such an analysis was not carried out for SWIFI.

References

  • Aidemark JL, Vinter JP, Folkesson P, Karlsson J (2001) GOOFI: A generic fault injection tool. Proceedings of IEEE/IFIP DSN. Göteborg, Sweden, pp 83–88

    Google Scholar 

  • Arlat, J (1990) Dependability validation by fault injection: method, implementation, application, Doctorat d’Etat Dissertation, INP, Toulouse, France (In French, available from LAAS)

    Google Scholar 

  • Arlat J, Aguera M, Amat L, Crouzet Y, Fabre J-C, Laprie J-C, Martins E, Powell D (1990) Fault injection for dependability validation – a methodology and some applications. IEEE TSE 16(2):166–182

    Google Scholar 

  • Arlat J, Crouzet Y (2002) Faultload representativeness for dependability benchmarking. Suppl. Volume IEEE/IFIP DSN. Washington, DC, USA, pp F.29–F.30

    Google Scholar 

  • Arlat J, Crouzet Y, Karlsson J, Folkesson P, Fuchs E, Leber GH (2003) Comparison of physical and software-implemented fault injection techniques. IEEE TC 52(9):1115–1133

    Google Scholar 

  • Avižienis A, Laprie J-C, Randell B, Landwehr C (2004) Basic concepts and taxonomy of dependable and secure computing. IEEE TDSC 1(1):11–33

    Google Scholar 

  • Baumann R (2005) Soft errors in advanced computer systems. IEEE Des Test Comput 22(3): 258–266

    Article  Google Scholar 

  • Benso A, Prinetto P (eds) (2003) Fault injection techniques and tools for embedded systems reliability evaluation. Frontiers in electronic testing, 23. Kluwer Academic, London, UK, 245p.

    Google Scholar 

  • Bouricius WG, Carter WC, Schneider PR (1969) Reliability modeling techniques for self-repairing computer systems. Proceedings of the 24th ACM National Conference, pp 295–309

    Google Scholar 

  • Crouzet Y, Landrault C (1980) Design of self-checking LSI circuits – application to a 4-bit microprocessor. IEEE TC C-29(6):532–537

    Google Scholar 

  • Crouzet Y (1978) Design of self-checking large scale integration circuits Ph.D Dissertation, INP, Toulouse, France (in French, available from LAAS)

    Google Scholar 

  • Crouzet Y, Galiay J, Landrault C, Rousseau P Vergniault M (1978) Definition and design of easily testable or self-testing LSI circuits Contract Rep. DRET 77/008, LAAS Report 1787, 334 p (in French)

    Google Scholar 

  • Crouzet Y, Waeselynck H, Lussier B, Powell D (2006) The SESAME experience: from assembly languages to declarative models. Proceedings of Mutation 2006, Raleigh, NC, USA

    Google Scholar 

  • Daran M, Thévenod-Fosse P (1996) Software error analysis: a real case study involving real faults and mutations. Proceedings of ISSTA’96, San Diego, CA, USA, 1996, pp 158–171

    Google Scholar 

  • de Andrés D, Ruiz JC, Gil D, Gil P (2008) Fault emulation for dependability evaluation of VLSI systems. IEEE TVLSIS 16(4):422–431

    Google Scholar 

  • Durães J, Madeira H (2006) Emulation of software faults: a field data study and a practical approach. IEEE TSE 32(11):849–867

    Google Scholar 

  • Folkesson PS, Svensson S, Karlsson J (1998) A comparison of simulation based and scan chain implemented fault injection. Proceedings of FTCS. Munich, Germany, pp 284–293

    Google Scholar 

  • Fuchs E (1996) An evaluation of the error detection mechanisms in MARS using software-implemented fault injection. Proceedings of EDCC. Taormina, Italy, pp 73–90

    Google Scholar 

  • Galiay J (1978) Design of easily testable LSI circuits PhD Dissertation, UPS, Toulouse, France (in French, available from LAAS)

    Google Scholar 

  • Galiay J, Crouzet Y, Vergniault M (1980) Physical versus logical faults models in MOS-LSI circuits – impact on their testability. IEEE TC C-29(6):527–531

    Google Scholar 

  • Gil P, Arlat J, Madeira H, Crouzet Y, Jarboui T, Kanoun K, Marteau T, Durães J, Vieira M, Gil D, Baraza JC, Gracia J (2002) Fault representativeness. IST Project DBench, Deliverable ETIE2, 101p.. Available at www.laas.fr/dbench/deliverables.html

  • Hély D, Bancel F, Flottes M-L, Rouzeyre B (2005) Scan design and security: can they work together? Sophia Antipolis MicroElectronics, Sophia Antipolis, France

    Google Scholar 

  • Johansson R (1994) On single event upset error manifestation. Proceedings of EDCC. Berlin, Germany, pp 217–231

    Google Scholar 

  • Kanoun K, Spainhower L (Eds) (2008) Dependability benchmarking for computer systems. IEEE CS Press and Wiley, 362 p.,

    Google Scholar 

  • Kopetz H, Bauer G (2003) The time-triggered architecture. Proceedings of IEEE 91(1):112–126

    Article  Google Scholar 

  • Kopetz H, Holzer P, Leber G, Schindler M (1991) The rolling ball on MARS. Vienna University of Technology, Research Report, No 13/91

    Google Scholar 

  • Leveugle R (2007) Early analysis of fault-based attack effects in secure circuits. IEEE TC 56(10):1431–1434

    MathSciNet  Google Scholar 

  • Moraes R, Barbosa R, Durães J, Mendes N, Martins E, Madeira H (2006) Injection of faults at component interfaces and inside the component code: are they equivalent? Proceedings of EDCC. Coimbra, Portugal, pp 53–64

    Google Scholar 

  • Powell D (1994) Distributed fault-tolerance – lessons from Delta-4. IEEE Micro 14(1):36–47

    Article  Google Scholar 

  • Reisinger J, Steininger A, Leber G (1995) The PDCS implementation of MARS hardware and software. In Predictably dependable computing systems, pp 209–224

    Google Scholar 

  • Rennels DA, Avižienis A, Ercegovac M (1978) A study of standard building blocks for the design of fault-tolerant distributed computer systems Proceedings of IEEE FTCS Toulouse, France, pp 144–149

    Google Scholar 

  • Roth JP, Bouricius WG Schneider PR (1978) Programmed algorithms to compute tests to detect and distinguish between failures in logic circuits IEEE TC C-16(10):567–580

    Google Scholar 

  • Sedmak RS Liebergot HL (1978) Fault-tolerance of a general purpose computer implemented by very large scale integration Proceedings of FTCS Toulouse, France, pp 137–143

    Google Scholar 

  • Stott DT, Ries G, Hsueh M-C, Iyer RK (1998) Dependability analysis of a high-speed network using software-implemented fault injection and simulated fault injection. IEEE TC 47(1): 108–119

    Google Scholar 

  • Wadsack RL (1978) Fault modelling and logic simulation of CMOS and MOS integrated circuits Bell Syst Tech J 57:1449–1473

    MATH  Google Scholar 

Download references

Acknowledgements

The pioneering research reported in Section 8.2 was led by Christian Landrault at LAAS-CNRS. Incidentally, it constitutes his first work on hardware testing, topic on which he has eagerly contributed since then at LIRMM. We are really pleased that we have been given the opportunity to participate in this way to this special book! The authors would like to thank several colleagues and friends from EFCIS (now ST Microelectronics), ESPRIT project PDCS, IST project DBench and from IFIP WG 10.4 on Dependable Computing and Fault Tolerance, for the fruitful exchanges along the years on the various topics addressed in the Chapter. In particular, we are grateful to Alain Costes and Michel Diaz (LAAS-CNRS), and also X. Messonnier, P. Rousseau, and Michel Vergniault (EFCIS) for their helpful comments, suggestions, and assistance for the study reported in Section 8.2. For what concerns Section 8.3, thanks go to Jean-Claude Laprie and Karama Kanoun (LAAS-CNRS), Johan Kalrsson and Peter Folkesson (Chalmers U.), Hermann Kopetz, Günther Leber and Emmerich Fuchs (Vienna UT), for their contributions to the reflections carried out and to the comprehensive comparative study reported. This work was supported in part by DRET, EFCIS, ESPRIT project PDCS, IST project DBench, and IST network of excellence ReSIST. Jacques Galiay, whose contribution to the work on offline testing was essential, sadly deceased in the early 1980s, during a hike in the Alps mountains.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jean Arlat .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer Science+Business Media B.V.

About this chapter

Cite this chapter

Arlat, J., Crouzet, Y. (2010). Physical Fault Models and Fault Tolerance. In: Wunderlich, HJ. (eds) Models in Hardware Testing. Frontiers in Electronic Testing, vol 43. Springer, Dordrecht. https://doi.org/10.1007/978-90-481-3282-9_8

Download citation

  • DOI: https://doi.org/10.1007/978-90-481-3282-9_8

  • Published:

  • Publisher Name: Springer, Dordrecht

  • Print ISBN: 978-90-481-3281-2

  • Online ISBN: 978-90-481-3282-9

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics