On balancing the load in a clustered web farm
- 1 November 2001
- journal article
- Published by Association for Computing Machinery (ACM) in ACM Transactions on Internet Technology
- Vol. 1 (2) , 231-261
- https://doi.org/10.1145/502152.502155
Abstract
In this article we propose a novel, yet practical, scheme which attempts to optimally balance the load on the servers of a clustered Web farm. The goal in solving this performance problem is to achieve minimal average response time for customer requests, and thus ultimately achieve maximal customer throughput. The article decouples the overall problem into two related but distinct mathematical subproblems, one static and one dynamic. We believe this natural decoupling is one of the major contributions of our article. The static component algorithm determines good assignments of sites to potentially overlapping servers. These cluster assignments, which, due to overhead, cannot be changed too frequently, have a major effect on achievable response time. Additionally, these assignments must be palatable to the sites themselves. The dynamic component algorithm is designed to handle real-time load balancing by routing customer requests from the network dispatcher to the servers. This algorithm must react to fluctuating customer request load while respecting the assignments of sites to servers determined by the static component. The static and dynamic components both employ in various contexts the same so-called goal setting algorithm. This algorithm determines the theoretically optimal load on each server, given hypothetical cluster assignments and site activity. We demonstrate the effectiveness of the overall load-balancing scheme via a number of simulation experiments.Keywords
This publication has 11 references indexed in Scilit:
- Dynamic load balancing on Web-server systemsIEEE Internet Computing, 1999
- Analysis and characterization of large‐scale Web server access patterns and performanceWorld Wide Web, 1999
- Approximation Algorithms for NP-Hard ProblemsACM SIGACT News, 1997
- Tabu SearchPublished by Springer Nature ,1997
- Optimal partitioning of cache memoryIEEE Transactions on Computers, 1992
- Simulated Annealing: Theory and ApplicationsPublished by Springer Nature ,1987
- The Greedy Procedure for Resource Allocation Problems: Necessary and Sufficient Conditions for OptimalityOperations Research, 1986
- Throughput concavity and response time convexityInformation Processing Letters, 1984
- The complexity of selection and ranking in X + Y and matrices with sorted columnsJournal of Computer and System Sciences, 1982
- A Fast Selection Algorithm and the Problem of Optimum Distribution of EffortJournal of the ACM, 1979