Size-based scheduling to improve web performance
Top Cited Papers
- 1 May 2003
- journal article
- Published by Association for Computing Machinery (ACM) in ACM Transactions on Computer Systems
- Vol. 21 (2) , 207-233
- https://doi.org/10.1145/762483.762486
Abstract
Is it possible to reduce the expected response time of every request at a web server, simply by changing the order in which we schedule the requests? That is the question we ask in this paper.This paper proposes a method for improving the performance of web servers servicing static HTTP requests. The idea is to give preference to requests for small files or requests with short remaining file size, in accordance with the SRPT (Shortest Remaining Processing Time) scheduling policy.The implementation is at the kernel level and involves controlling the order in which socket buffers are drained into the network. Experiments are executed both in a LAN and a WAN environment. We use the Linux operating system and the Apache and Flash web servers.Results indicate that SRPT-based scheduling of connections yields significant reductions in delay at the web server. These result in a substantial reduction in mean response time and mean slowdown for both the LAN and WAN environments. Significantly, and counter to intuition, the requests for large files are only negligibly penalized or not at all penalized as a result of SRPT-based scheduling.Keywords
This publication has 9 references indexed in Scilit:
- Workload characterization of a Web proxy in a cable modem environmentACM SIGMETRICS Performance Evaluation Review, 1999
- Measuring the capacity of a Web server under realistic loadsWorld Wide Web, 1999
- Better operating system features for faster network serversACM SIGMETRICS Performance Evaluation Review, 1998
- Self-similarity in World Wide Web traffic: evidence and possible causesIEEE/ACM Transactions on Networking, 1997
- Exploiting process lifetime distributions for dynamic load balancingACM Transactions on Computer Systems, 1997
- DummynetACM SIGCOMM Computer Communication Review, 1997
- Improving HTTP latencyComputer Networks and ISDN Systems, 1995
- The Queue M/G/1 with the Shortest Remaining Processing Time DisciplineOperations Research, 1966
- A Proof for the Queuing Formula: L = λWOperations Research, 1961