Abstract
We present an architecture and prototype implementation of a performance management system for cluster-based web services. The system supports multiple classes of web services traffic and allocates server resources dynamically so to maximize the expected value of a given cluster utility function in the face of fluctuating loads. The cluster utility is a function of the performance delivered to the various classes, and this leads to differentiated service. In this paper we will use the average response time as the performance metric. The management system is transparent: it requires no changes in the client code, the server code, or the network interface between them. The system performs three performance management tasks: resource allocation, load balancing, and server overload protection. We use two nested levels of management mechanism. The inner level centers on queuing and scheduling of request messages. The outer level is a feedback control loop that periodically adjusts the scheduling weights and server allocations of the inner level. The feedback controller is based on an approximate first-principles model of the system, with parameters derived from continuous monitoring. We focus on SOAP-based web services. We report experimental results that show the dynamic behavior of the system.
The original version of this chapter was revised: The copyright line was incorrect. This has been corrected. The Erratum to this chapter is available at DOI: 10.1007/978-0-387-35674-7_66
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
T. Abdelzaher, K. Shin, and N. Bhatti, “Performance Guarantees for Web Server End-Systems: A Control-Theoretical Approach”, IEEE Transactions on Parallel and Distributed Systems, Vol. 13, No. 1, Jan 2002.
S. Aissi, P. Malu, and K. Srinivasan, “E-Business Process Modeling: The Next Big Step”, IEEE Computer 35 (5), pp 55–62, May 2002.
Apache XML Project, http://xml.apache.org/axis/
K. Appleby, S. Fakhouri, L. Fong, G. Goldszmidt, M. Kalantar, S. Krishnakumar, DP. Pazel, J. Pershing, and B. Rochwerger, “Oceano SLA based management of a computing utility”, Proceedings of 2001 International Symposium on Integrated Network Management, Page 14–18. May 2001.
M. Aron, P. Druschel, and W. Zwaenepoel, “Cluster Reserves: A Mechanism for Resource Management in Cluster-based Network Servers”, ACM Sigmetrics 2000, Santa Clara, CA, Jun 2000.
G. Banga, J. Mogul, and P. Druschel, “Resource containers: A new facility for resource management in server systems”, Proceedings of the Third Symposium on Operating Systems Design and Implementation (OSDI’99), New Orleans, LA, Feb 1999.
J. Carlström, and R. Rom, “Application-aware Admission Control and Scheduling in Web Servers”, IEEE INFOCOM 2002, New York, NY, Jun 2002.
J. Chase, D. Anderson, P. Thakar, A. Vandat, and R. Doyle, “Managing Energy and Server Resources in Hosting Centers”, Proceedings of 18 th ACM Symposium on Operating System Principles, pages 103–116, Oct 2001.
H. Chen and P. Mohapatra, “Session-Based Overload Control in QoS-Aware Web Servers”, IEEE INFOCOM 2002, New York, NY, Jun 2002.
Y. Diao, N. Gandhi, J. L. Hellerstein, S. Parekh, and D. M. Tilbury, “Using MIMO Feedback Control to Enforce Policies for Interrelated Metrics With Application to the ApacheWeb Server”, Proc. NOMS 2002, 219–234, Apr 15–19, 2002, Florence, Italy.
L. Kleinrock, Queueing Systems — Volume 1: Theory, John Wiley, 1975.
R. Levy, J. Nagarajarao, G. Pacifici, M. Spreitzer, A. Tantawi, and A. Youssef, “Performance Management For Cluster Based Web Services”, IBM Research Technical Report, RC22676, Dec 2002.
S. H. Low and D. E. Lapsley, “Optimization Flow Control I: basic Algorithm and Convergence”, IEEE/A CM Transactions on Networking, Vol. 7, No. 6, Dec 1999.
P. Marbach, “Priority Service and Max-Min Fairness”, IEEE INFOCOM 2002, New York, NY, Jun 2002.
D. Schmidt, “Middleware for Real-Time and Embedded Systems”, Communications of the ACM, Vol. 45, No. 6, Jun 2002.
Sun Microsystems, Java Messaging Service API,http://java.sun.com/products/jms/
S.J. Vaughan-Nichols, “Web Services: Beyond the Hype”, IEEE Computer, Feb 2002.
T. Voigt, R. Tewari, D. Freimuth, and A. Mehra, “Kernel Mechanisms for Service Differentiation in Overloaded Web Servers”, In Proceedings of the 2001 USENIX Annual Technical Conference, Boston, MA,Jun 2001.
T. Zhao and V. Karamcheti, “Enforcing Resource Sharing Agreements among Distributed Server Clusters”, Proceedings International Parallel and Distributed Processing Symposium, IPDPS 2002, Ft. Lauderdale, FL, Apr 2002, pp. 501–510.
H. Zhu, H. Tang, and T. Yang, “Demand-driven Service Differentiation in Cluster-based Network Servers”, IEEE INFOCOM 2001, Anchorage, Alaska, Apr 2001.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 IFIP International Federation for Information Processing
About this chapter
Cite this chapter
Levy, R., Nagarajarao, J., Pacifici, G., Spreitzer, M., Tantawi, A., Youssef, A. (2003). Performance Management for Cluster Based Web Services. In: Goldszmidt, G., Schönwälder, J. (eds) Integrated Network Management VIII. IM 2003. IFIP — The International Federation for Information Processing, vol 118. Springer, Boston, MA. https://doi.org/10.1007/978-0-387-35674-7_29
Download citation
DOI: https://doi.org/10.1007/978-0-387-35674-7_29
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4757-5521-3
Online ISBN: 978-0-387-35674-7
eBook Packages: Springer Book Archive