Advertisement

Non-Markovian Modeling of a BladeCenter Chassis Midplane

  • Salvatore Distefano
  • Francesco Longo
  • Marco Scarpa
  • Kishor S. Trivedi
Part of the Lecture Notes in Computer Science book series (LNCS, volume 8721)

Abstract

In distributed contexts such as Cloud computing, the reliability and availability of the provided resources and services have to be assured in order to meet user requirements. At the infrastructure level, this specification is translated into tighter ones on the datacenter hosting physical resources. In this paper, starting from a real case study of the IBM BladeCenter, we provide a technique for the quantitative evaluation of datacenter infrastructure availability. The proposed technique allows one to take into account both aging phenomena and multiple operating conditions. In particular, one subsystem of the BladeCenter, the chassis midplane, is studied. Indeed, based on the stochastic characterization of the midplane reliability through statistic measurements, a model dealing with the non-exponential failure time distribution thus obtained is evaluated to demonstrate the suitability and the effectiveness of the proposed technique.

Keywords

Weibull Distribution State Space Model Load Sharing Continuous Time Markov Chain Power Domain 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Resnick, R.I.: A Modern Taxonomy of High Availability. Technical report, Interlog (1996), http://www.generalconcepts.com/resources/reliability/resnick/HA.htm
  2. 2.
    Cisco Systems Inc.: Data Center High Availability Clusters Design Guide (2006), http://www.ciscocertified.info/en/US/docs/solutions/Enterprise/Data_Center/HA/_Clusters/HA/_Clusters.html
  3. 3.
    Ghosh, R., Longo, F., Frattini, F., Russo, S., Trivedi, K.: Scalable analytics for iaas cloud availability. IEEE Transactions on Cloud Computing 2(1), 57–70 (2014)CrossRefGoogle Scholar
  4. 4.
    Zheng, J., Okamura, H., Dohi, T.: Component importance analysis of virtualized system. In: 2012 9th International Conference on Ubiquitous Intelligence Computing and 9th International Conference on Autonomic Trusted Computing (UIC/ATC), pp. 462–469 (September 2012)Google Scholar
  5. 5.
    Muppala, J., Malhotra, M., Trivedi, K.: Markov dependability models of complex systems: Analysis techniques. In: Özekici, S. (ed.) Reliability and Maintenance of Complex Systems. NATO ASI Series, vol. 154, pp. 442–486. Springer, Heidelberg (1996)CrossRefGoogle Scholar
  6. 6.
    Fricks, R., Yin, L., Trivedi, K.: Application of semi-markov process and CTMC to evaluation of UPS system availability. In: RAMS 2002, January 28–31, pp. 584–591 (2002)Google Scholar
  7. 7.
    Mura, I., Bondavalli, A.: Markov regenerative stochastic petri nets to model and evaluate phased mission systems dependability. IEEE Transactions on Computers 50(12), 1337–1351 (2001)CrossRefGoogle Scholar
  8. 8.
    Longo, F., Scarpa, M.: Applying symbolic techniques to the representation of non-markovian models with continuous PH distributions. In: Bradley, J.T. (ed.) EPEW 2009. LNCS, vol. 5652, pp. 44–58. Springer, Heidelberg (2009); Cited by (since 1996)6Google Scholar
  9. 9.
    Longo, F., Scarpa, M.: Two-layer symbolic representation for stochastic models with phase-type distributed events. International Journal of Systems Science, 1–32Google Scholar
  10. 10.
    Pérez-Ocón, R., Castro, J.E.R.: Two models for a repairable two-system with phase-type sojourn time distributions. Reliability Engineering & System Safety 84(3), 253–260 (2004)CrossRefGoogle Scholar
  11. 11.
    Smith, W.E., Trivedi, K., Tomek, L., Ackaret, J.: Availability analysis of blade server systems. IBM Systems Journal 47(4), 621–640 (2008)CrossRefGoogle Scholar
  12. 12.
    Bruneo, D., Distefano, S., Longo, F., Puliafito, A., Scarpa, M.: Reliability assessment of wireless sensor nodes with non-linear battery discharge (2010); Cited by (since 1996)5Google Scholar
  13. 13.
    Bruneo, D., Distefano, S., Longo, F., Puliafito, A., Scarpa, M.: Evaluating wireless sensor node longevity through markovian techniques. Comput. Netw. 56(2), 521–532 (2012)CrossRefGoogle Scholar
  14. 14.
    Sedyakin, N.: On one physical principle in reliability theory. Tekhn. Kibernetika (in Russian - Technical Cybernetics) 3, 80–87 (1966)Google Scholar

Copyright information

© Springer International Publishing Switzerland 2014

Authors and Affiliations

  • Salvatore Distefano
    • 1
  • Francesco Longo
    • 2
  • Marco Scarpa
    • 2
  • Kishor S. Trivedi
    • 3
  1. 1.Dipartimento di Elettronica, Informazione e BioingegneriaPolitecnico di MilanoMilanoItaly
  2. 2.Dipartimento di Ingegneria DICIEAMAUniversità di MessinaMessinaItaly
  3. 3.Department of Electrical and Computer EngineeringDuke UniversityDurhamUSA

Personalised recommendations