Efficient algorithms for predicting requests to Web servers

1 January 1999

conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE)

Vol. 1, 284-293 vol.1
https://doi.org/10.1109/infcom.1999.749294

Abstract

Internet traffic has grown significantly with the popularity of the Web. Consequently user perceived latency in retrieving Web pages has increased. Caching and prefetching at the client side, aided by hints from the server, are attempts at solving this problem. We suggest techniques to group resources that are likely to be accessed together into volumes, which are used to generate hints tailored to individual applications, such as prefetching, cache replacement, and cache validation. We discuss theoretical aspects of optimal volume construction, and develop efficient heuristics. Tunable parameters allow our algorithms to predict as many accesses as possible while reducing false predictions and limiting the size of hints. We analyze a collection of large server logs, extracting access patterns to construct and evaluate volumes. We examine sampling techniques to process only portions of the server logs while constructing equally good volumes. We show that it is possible to predict requests at low cost with a high degree of precision.

Keywords

This publication has 8 references indexed in Scilit:

The network effects of prefetching
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
Improving end-to-end performance of the Web using server volumes and proxy filters
ACM SIGCOMM Computer Communication Review, 1998
Piggyback server invalidation for proxy cache coherency
Computer Networks and ISDN Systems, 1998
An adaptive network prefetch scheme
IEEE Journal on Selected Areas in Communications, 1998
Hinted caching in the web
Published by Association for Computing Machinery (ACM) ,1996
Using predictive prefetching to improve World Wide Web latency
ACM SIGCOMM Computer Communication Review, 1996
Using speculation to reduce server load and service time on the WWW
Published by Association for Computing Machinery (ACM) ,1995
A Greedy Heuristic for the Set-Covering Problem
Mathematics of Operations Research, 1979