A data mining algorithm for generalized web prefetching
- 29 September 2003
- journal article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Knowledge and Data Engineering
- Vol. 15 (5) , 1155-1169
- https://doi.org/10.1109/tkde.2003.1232270
Abstract
Predictive Web prefetching refers to the mechanism of deducing the forthcoming page accesses of a client based on its past accesses. In this paper, we present a new context for the interpretation of Web prefetching algorithms as Markov predictors. We identify the factors that affect the performance of Web prefetching algorithms. We propose a new algorithm called WM,,, which is based on data mining and is proven to be a generalization of existing ones. It was designed to address their specific limitations and its characteristics include all the above factors. It compares favorably with previously proposed algorithms. Further, the algorithm efficiently addresses the increased number of candidates. We present a detailed performance evaluation of WM, with synthetic and real data. The experimental results show that WM/sub o/ can provide significant improvements over previously proposed Web prefetching algorithms.Keywords
This publication has 26 references indexed in Scilit:
- Characterizing reference locality in the WWWPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Speculative data dissemination and service to reduce server load, network traffic and service time in distributed information systemsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Mining sequential patternsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Rule-assisted prefetching in Web-server cachingPublished by Association for Computing Machinery (ACM) ,2000
- Declarative specification of Web sites with SThe VLDB Journal, 2000
- Caching on the World Wide WebIEEE Transactions on Knowledge and Data Engineering, 1999
- WebCompanion: a friendly client-side Web prefetching agentIEEE Transactions on Knowledge and Data Engineering, 1999
- Generating representative Web workloads for network and server performance evaluationACM SIGMETRICS Performance Evaluation Review, 1998
- Strong Regularities in World Wide Web SurfingScience, 1998
- Levelwise Search and Borders of Theories in Knowledge DiscoveryData Mining and Knowledge Discovery, 1997