Effectiveness of hardware-based stride and sequential prefetching in shared-memory multiprocessors
- 19 November 2002
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- Vol. 12, 68-77
- https://doi.org/10.1109/hpca.1995.386554
Abstract
We study the relative efficiency of previously proposed stride and sequential prefetching-two promising hardware-based prefetching schemes to reduce read-miss penalties in shared-memory multiprocessors. Although stride accesses dominate in four out of six of the applications we study, we find that sequential prefetching does better than stride prefetching for three applications. This is because (i) most strides are shorter than the block size (we assume 32 byte blocks), which means that sequential prefetching is as effective for stride accesses, and (ii) sequential prefetching also exploits the locality of read misses for non-stride accesses. However we find that since stride prefetching causes fewer useless prefetches, it consumes less memory-system bandwidth.Keywords
This publication has 16 references indexed in Scilit:
- The Cachemire Test Bench A Flexible And Effective Approach For Simulation Of MultiprocessorsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- Prefetch unit for vector operations on scalar computersACM SIGARCH Computer Architecture News, 1992
- SPLASHACM SIGARCH Computer Architecture News, 1992
- Tolerating latency through software-controlled prefetching in shared-memory multiprocessorsJournal of Parallel and Distributed Computing, 1991
- Comparative evaluation of latency reducing and tolerating techniquesACM SIGARCH Computer Architecture News, 1991
- Data prefetching in multiprocessor vector cache memoriesPublished by Association for Computing Machinery (ACM) ,1991
- Performance evaluation of memory consistency models for shared-memory multiprocessorsPublished by Association for Computing Machinery (ACM) ,1991
- A survey of cache coherence schemes for multiprocessorsComputer, 1990
- A New Solution to Coherence Problems in Multicache SystemsIEEE Transactions on Computers, 1978
- Sequential Program Prefetching in Memory HierarchiesComputer, 1978