Skip to main content

Non-intrusive System Level Fault-Tolerance

  • Conference paper
Reliable Software Technology – Ada-Europe 2005 (Ada-Europe 2005)

Part of the book series: Lecture Notes in Computer Science ((LNPSE,volume 3555))

Included in the following conference series:

Abstract

High-integrity embedded systems operate in multiple modes, in order to ensure system availability in the face of faults. Unanticipated state-dependent faults that remain in software after system design and development behave like hardware transient faults: they appear, do the damage and disappear. The conventional approach used for handling task overruns caused by transient faults is to use a single recovery task that implements minimal functionality. This approach provides limited availability and should be used as a last resort in order to keep the system online. Traditional fault detection approaches are often intrusive in that they consume processor resources in order to monitor system behavior. This paper presents a novel approach for fault-monitoring by leveraging the Ravenscar profile, model-checking and a system-on-chip implementation of both the kernel and an execution time monitor. System fault-tolerance is provided through a hierarchical set of operational modes that are based on timing behavior violations of individual tasks within the application. The approach is illustrated through a simple case study of a generic navigation system.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Alur, R., Dill, D.L.: Automata for modeling real-time systems. In: Paterson, M. (ed.) ICALP 1990. LNCS, vol. 443, pp. 322–335. Springer, Heidelberg (1990)

    Chapter  Google Scholar 

  2. Burns, A.: The Ravenscar Profile. ACM Ada Letters XIX(4), 49–52 (1999)

    Article  Google Scholar 

  3. Burns, A.: How to Verify a Safe Real-Time System: The Application of Model Checking and Timed Automata to the Production Cell Case Study. Real-Time Systems 24, 135–151 (2003)

    Article  MATH  Google Scholar 

  4. Burns, A., Dobbing, B., Vardanega, T.: Guide for the Use of the Ada Ravenscar Profile in High Integrity Systems. University of York Technical Report YCS-2003-348 (2003)

    Google Scholar 

  5. de la Puente, J.A., Zamorano, J.: Execution-Time Clocks and Ravenscar Kernels. Ada Letters XXIII(4) (December 2003)

    Google Scholar 

  6. Behrmann, G., David, A., Larsen, K.G.: A tutorial on uppaal. In: Bernardo, M., Corradini, F. (eds.) SFM-RT 2004. LNCS, vol. 3185, pp. 200–236. Springer, Heidelberg (2004)

    Chapter  Google Scholar 

  7. Harbour, M.G., Rivas, M.A., et al.: Implementing and using execution time clocks in ada hard real-time applications. In: Asplund, L. (ed.) Ada-Europe 1998. LNCS, vol. 1411, pp. 90–101. Springer, Heidelberg (1998)

    Chapter  Google Scholar 

  8. Hopcroft, J.E., Ullman, J.D.: Introduction of Automata Theory, Languages, and Computation. Addison Wesley, Reading (2001)

    Google Scholar 

  9. ISO/IEC Ada 95 Reference Manual, Language and Standard Libraries, Version 6.0

    Google Scholar 

  10. Lee, P.A., Anderson, T.: Fault Tolerance: Principles and Practice (Second Revised edn.). Springer, New York

    Google Scholar 

  11. Liu, C.L., Layland, J.W.: Scheduling Algorithms for Multiprogramming in a Hard-Real-Time Environment. Journal of the ACM (JACM) 20(1), 46–61 (1973)

    Article  MATH  MathSciNet  Google Scholar 

  12. Leung, J.Y.T., Whitehead, J.: On the complexity of fixed-priority scheduling of periodic real-time tasks. Performance Evaluation 2(4), 237–250 (1982)

    Article  MATH  MathSciNet  Google Scholar 

  13. Pettersson, P., Larsen, K.G.: UPPAAL2k. In: Bulletin of the European Association for Theoretical Computer Science, vol. 70, pp. 40–44 (2000)

    Google Scholar 

  14. Real, J., Crespo, A.: Mode Change Protocols for Real-Time Systems: A Survey and a New Proposal. Real-Time Systems 4, 161–197 (2004)

    Article  Google Scholar 

  15. Ram Murthy, C.S., Manimaran, G.: Resource Management in Real-Time Systems and Networks. The MIT Press, Cambridge (2001)

    Google Scholar 

  16. Silbovitz, A.: RavenHaRT- A Hardware Implementation of a Ravenscar Compliant Kernel, SM Thesis, Aeronautics and Astronautics, MIT (2003)

    Google Scholar 

  17. Torres-Pomales, W.: Software Fault-Tolerance: A Tutorial, NASA Technical Report, NASA-2000-tm210616 (2000)

    Google Scholar 

  18. Virtex-II Pro Platform FPGA Handbook, v1.0 (2002), http://www.xilinx.com

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Lundqvist, K., Srinivasan, J., Gorelov, S. (2005). Non-intrusive System Level Fault-Tolerance. In: Vardanega, T., Wellings, A. (eds) Reliable Software Technology – Ada-Europe 2005. Ada-Europe 2005. Lecture Notes in Computer Science, vol 3555. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11499909_13

Download citation

  • DOI: https://doi.org/10.1007/11499909_13

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-26286-2

  • Online ISBN: 978-3-540-31666-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics