Efficient algorithms for predicting requests to Web servers
- 1 January 1999
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- Vol. 1, 284-293 vol.1
- https://doi.org/10.1109/infcom.1999.749294
Abstract
Internet traffic has grown significantly with the popularity of the Web. Consequently user perceived latency in retrieving Web pages has increased. Caching and prefetching at the client side, aided by hints from the server, are attempts at solving this problem. We suggest techniques to group resources that are likely to be accessed together into volumes, which are used to generate hints tailored to individual applications, such as prefetching, cache replacement, and cache validation. We discuss theoretical aspects of optimal volume construction, and develop efficient heuristics. Tunable parameters allow our algorithms to predict as many accesses as possible while reducing false predictions and limiting the size of hints. We analyze a collection of large server logs, extracting access patterns to construct and evaluate volumes. We examine sampling techniques to process only portions of the server logs while constructing equally good volumes. We show that it is possible to predict requests at low cost with a high degree of precision.Keywords
This publication has 8 references indexed in Scilit:
- The network effects of prefetchingPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Improving end-to-end performance of the Web using server volumes and proxy filtersACM SIGCOMM Computer Communication Review, 1998
- Piggyback server invalidation for proxy cache coherencyComputer Networks and ISDN Systems, 1998
- An adaptive network prefetch schemeIEEE Journal on Selected Areas in Communications, 1998
- Hinted caching in the webPublished by Association for Computing Machinery (ACM) ,1996
- Using predictive prefetching to improve World Wide Web latencyACM SIGCOMM Computer Communication Review, 1996
- Using speculation to reduce server load and service time on the WWWPublished by Association for Computing Machinery (ACM) ,1995
- A Greedy Heuristic for the Set-Covering ProblemMathematics of Operations Research, 1979