Skip to main content
Log in

Analysis of field data on computer failures

  • Regular Papers
  • Published:
Journal of Computer Science and Technology Aims and scope Submit manuscript

Abstract

This paper emphasizes the importance of making field measurements for effective and realistic dependability evaluations. Two examples are given, both based on real data from IBM mainframes. The first evaluates the impact of the operating environment on system failure characteristics and the second shows how an accurate model depicting this interaction can be extracted from real data.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Arlat, J. and Laprie, J. C., Performance-Related Dependability Evaluation of Supercomputer Systems, inIEEE FTCS-13, Milano, Italy, June 28–30, 1983.

  2. Meyer, J.F., Closed-form solutions of performability,IEEE Transactions on Computers, C-31: 7 (1982), 648–657.

    Article  Google Scholar 

  3. Castillo, X., A Compatible Hardware/Software Reliability Prediction Model. PhD Thesis, Carnegie-Mellon University, July, 1981.

  4. Iyer, R. K. and Rossetti, D.J., A Statistical Load Dependency Model for CPU Errors at SLAC, InProceedings of the 12th International Symposium Fault-Tolerant Computing, Santa Monica, California, 363–372, June 1982.

  5. Hsueh, M. C., Measurement-Based Reliability/Performability Models. Ph.D. Dissertation, Computer Science Department, University of Illinois at Urbana-Champaign, August 1987.

  6. Iyer, R.K., Butner, S.E., and McCluskey, E. J., A statistical failure/load relationship: results of a multicomputer study,IEEE Transactions on Computer, G-31: 7 (1982), 697–706.

    Article  Google Scholar 

  7. Iyer, R.K. and Rossetti, D.J., Effect of system workload on operating system reliability: a study on IBM 3081,IEEE Transactions on Sof tware Engineering, SE-11: 12 (1985), 1438–1448.

    Article  Google Scholar 

  8. Iyer, R.K., Rossetti, D.J., and Hsueh, M.C., Measurement and modeling of computer reliability as affected by system activity,ACM Transactions on Computer Systems,4: 3 (1986), 214–237.

    Article  Google Scholar 

  9. Hsueh, M. C., Iyer, R.K., and Trivedi, K.S., Performability modeling based on real data: a case study,IEEE Transactions on Computers, C-37: 4 (1988).

    Google Scholar 

  10. Ferrari, D., Serazzi, G., and Zeigner, A., Measurement and Tuning of Computer Systems, Englewood Cliffs, NJ: Prentice-Hall, Inc., 1981.

    Google Scholar 

  11. Spath, H, Cluster Analysis Algorithms, West Sussex, England: Ellis Horwood Ltd., 1980.

    Google Scholar 

  12. Trivedi, K.S., Probability & Statistics with Reliability, Queuing, and Computer Science Applications, Englewood Cliffs, N.J.: Prentice-Hall, Inc., 1982.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

About this article

Cite this article

Iyer, R.K., Hsueh, MC. Analysis of field data on computer failures. J. of Compt. Sci. & Technol. 5, 99–108 (1990). https://doi.org/10.1007/BF02943416

Download citation

  • Received:

  • Revised:

  • Issue Date:

  • DOI: https://doi.org/10.1007/BF02943416

Keywords

Navigation