The content and access dynamics of a busy Web site
- 28 August 2000
- conference paper
- Published by Association for Computing Machinery (ACM)
- Vol. 30 (4) , 111-123
- https://doi.org/10.1145/347059.347413
Abstract
In this paper, we study the dynamics of the MSNBC news site, one of the busiest Web sites in the Internet today. Unlike many other efforts that have analyzed client accesses as seen by proxies, we focus on the server end. We analyze the dynamics of both the server content and client accesses made to the server. The former considers the content creation and modification process while the latter considers page popularity and locality in client accesses. Some of our key results are: (a) files tend to change little when they are modified, (b) a small set of files tends to get modified repeatedly, (c) file popularity follows a Zipf-like distribution with a parameter &agr that is much larger than reported in previous, proxy-based studies, and (d) there is significant temporal stability in file popularity but not much stability in the domains from which clients access the popular content. We discuss the implications of these findings for techniques such as Web caching (including cache consistency algorithms), and prefetching or server-based ``push'' of Web content.Keywords
This publication has 9 references indexed in Scilit:
- On the scale and performance of cooperative Web proxy cachingPublished by Association for Computing Machinery (ACM) ,1999
- A scalable Web cache consistency architecturePublished by Association for Computing Machinery (ACM) ,1999
- Web prefetching between low-bandwidth clients and proxiesPublished by Association for Computing Machinery (ACM) ,1999
- Web caching and Zipf-like distributions: evidence and implicationsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1999
- Improving end-to-end performance of the Web using server volumes and proxy filtersPublished by Association for Computing Machinery (ACM) ,1998
- Delta algorithmsACM Transactions on Software Engineering and Methodology, 1998
- Potential benefits of delta encoding and data compression for HTTPPublished by Association for Computing Machinery (ACM) ,1997
- Using predictive prefetching to improve World Wide Web latencyACM SIGCOMM Computer Communication Review, 1996
- Web server workload characterizationPublished by Association for Computing Machinery (ACM) ,1996