Cluster Computing

, Volume 17, Issue 3, pp 979–999 | Cite as

Towards autonomic performance management of large scale data centers using interaction balance principle



In this paper, an autonomic performance management approach is introduced that can be applied to a general class of web services deployed in large scale distributed environment. The proposed approach utilizes traditional large scale control-based algorithms by using interaction balance approach in web service environment for managing the response time and the system level power consumption. This approach is developed in a generic fashion that makes it suitable for web service deployments, where web service performance can be adjusted by using a finite set of control inputs. This approach maintains the service level agreements, maximizes the revenue, and minimizes the infrastructure operating cost. Additionally, the proposed approach is fault-tolerant with respect to the failures of the computing nodes inside the distributed deployment. Moreover, the computational overhead of the proposed approach can also be managed by using appropriate value of configuration parameters during its deployment.


Autonomic computing Interaction balance Distributed control 



This research was made possible by NPRP grant # NPRP 09-778-2299 from the Qatar National Research Fund (a member of Qatar Foundation). The statements made herein are solely the responsibility of the authors.


  1. 1.
    Abdelwahed, S., Hassan, M., Sultan, M.: Parallel asynchronous algorithms for optimal control of large-scale dynamic systems. Optim. Control Appl. Methods 18 (1997) Google Scholar
  2. 2.
    Abdelwahed, S., Bai, J., Su, R., Kandasamy, N.: On the application of predictive control techniques for adaptive performance management of computing systems. IEEE Trans. Netw. Serv. Manag. 6(4), 212–225 (2009). doi: 10.1109/TNSM.2009.04.090402 CrossRefGoogle Scholar
  3. 3.
    Arlitt, M., Jin, T.: Workload characterization of the 1998 world cup web site. Technical Report HPL-99-35R1, Hewlett-Packard Labs (1999) Google Scholar
  4. 4.
    DeLurgio, S.A.: Forecasting Principles and Applications. McGraw-Hill, New York (1998) Google Scholar
  5. 5.
    Dunbar, W.B., Murray, R.M.: Distributed receding horizon control for multi-vehicle formation stabilization. Automatica 42(4), 549–558 (2006). doi: 10.1016/j.automatica.2005.12.008, CrossRefMATHMathSciNetGoogle Scholar
  6. 6.
    Edwin, G., Cox, M.T.: In: Comas: Coordination in Multiagent Systems (2001) Google Scholar
  7. 7.
    Hou, Z.G.: A hierarchical optimization neural network for large-scale dynamic systems. Automatica 37(12), 1931–1940 (2001). doi: 10.1016/S0005-1098(01)00158-3, CrossRefMATHMathSciNetGoogle Scholar
  8. 8.
    Kalman, R.E.: A new approach to linear filtering and prediction problems. Trans. ASME Ser. D, J. Basic. Eng. 82, 35–45 (1960). CrossRefGoogle Scholar
  9. 9.
    Kandasamy, N., Abdelwahed, S., Khandekar, M.: A hierarchical optimization framework for autonomic performance management of distributed computing systems. In: Proc. 26th IEEE Int’l Conf. Distributed Computing Systems (ICDCS) (2006) Google Scholar
  10. 10.
    Keviczky, T., Borrelli, F., Balas, G.: Hierarchical design of decentralized receding horizon controllers for decoupled systems. In: 43rd IEEE Conference on Decision and Control, 2004, CDC, vol. 2, pp. 1592–1597 (2004). doi: 10.1109/CDC.2004.1430271 Google Scholar
  11. 11.
    Keviczky, T., Borrelli, F., Fregene, K., Godbole, D., Balas, G.: Decentralized receding horizon control and coordination of autonomous vehicle formations. IEEE Trans. Control Syst. Technol. 16(1), 19–33 (2008). doi: 10.1109/TCST.2007.903066 CrossRefGoogle Scholar
  12. 12.
    Kusic, D., Kandasamy, N., Jiang, G.: Approximation modeling for the online performance management of distributed computing systems. In: ICAC’07: Proceedings of the Fourth International Conference on Autonomic Computing, p. 23 (2007). doi: 10.1109/ICAC.2007.8 Google Scholar
  13. 13.
    Mehrotra, R., Dubey, A., Abdelwahed, S., Tantawi, A.: A Power-Aware Modeling and Autonomic Management Framework for Distributed Computing Systems. CRC Press, Boca Raton (2011) Google Scholar
  14. 14.
    Mehrotra, R., Dubey, A., Abdelwahed, S., Rowland, K.W.: Rfdmon: a real-time and fault-tolerant distributed system monitoring approach. In: The Eighth International Conference on Autonomic and Autonomous Systems. St. Maarten, Netherlands Antilles (2012) Google Scholar
  15. 15.
    Sadati, N.: A novel approach to coordination of large-scale systems; part I interaction prediction principle. In: IEEE International Conference on Industrial Technology, 2005, ICIT, pp. 641–647 (2005). doi: 10.1109/ICIT.2005.1600716 Google Scholar
  16. 16.
    Sadati, N.: A novel approach to coordination of large-scale systems; part II interaction balance principle. In: IEEE International Conference on Industrial Technology, 2005, ICIT, pp. 648–654 (2005). doi: 10.1109/ICIT.2005.1600717 Google Scholar
  17. 17.
    Sadati, N., Dumont, G.A.: A reinforcement learning approach to intelligent goal coordination of two-level large-scale control systems. In: Mellouk, P.A. (ed.) Advances in Reinforce- ment Learning.
  18. 18.
    Titli, M.G.S.A.: Systems: Decomposition, Optimisation, and Control. Pergamon, Elmsford (1978) MATHGoogle Scholar
  19. 19.
    Wang, M., Kandasamy, N., Guez, A., Kam, M.: Adaptive performance control of computing systems via distributed cooperative control: application to power management in computing clusters. In: IEEE International Conference on Autonomic Computing, 2006, ICAC, pp. 165–174 (2006). doi: 10.1109/ICAC.2006.1662395 Google Scholar
  20. 20.
    Wang, C., Ong, C.J., Sim, M.: Distributed model predictive control of dynamically decoupled linear systems with coupled cost. In: Proceedings of the 48th IEEE Conference on Decision and Control, 2009 Held Jointly with the 2009 28th Chinese Control Conference, CDC/CCC, 2009, pp. 5420–5425 (2009). doi: 10.1109/CDC.2009.5400521 Google Scholar
  21. 21.
    Xu, J., Zhao, M., Fortes, J.A.: Cooperative autonomic management in dynamic distributed systems. In: Proceedings of the 11th International Symposium on Stabilization, Safety, and Security of Distributed Systems, SSS’09, pp. 756–770. Springer, Berlin (2009). doi: 10.1007/978-3-642-05118-0_52 CrossRefGoogle Scholar

Copyright information

© Springer Science+Business Media New York 2013

Authors and Affiliations

  1. 1.Department of Electrical and Computer EngineeringMississippi State UniversityOktibbehaUSA

Personalised recommendations