Tile size selection using cache organization and data layout
- 1 June 1995
- journal article
- Published by Association for Computing Machinery (ACM) in ACM SIGPLAN Notices
- Vol. 30 (6) , 279-290
- https://doi.org/10.1145/223428.207162
Abstract
When dense matrix computations are too large to fit in cache, previous research proposes tiling to reduce or eliminate capacity misses. This paper presents a new algorithm for choosing problem-size dependent tile sizes based on the cache size and cache line size for a direct-mapped cache. The algorithm eliminates both capacity and self-interference misses and reduces cross-interference misses. We measured simulated miss rates and execution times for our algorithm and two others on a variety of problem sizes and cache organizations. At higher set associativity, our algorithm does not always achieve the best performance. However on direct-mapped caches, our algorithm improves simulated miss rates and measured execution times when compared with previous work.This publication has 9 references indexed in Scilit:
- Compiler optimizations for improving data localityPublished by Association for Computing Machinery (ACM) ,1994
- MOB formsPublished by Association for Computing Machinery (ACM) ,1994
- To copy or not to copyPublished by Association for Computing Machinery (ACM) ,1993
- A data locality optimizing algorithmPublished by Association for Computing Machinery (ACM) ,1991
- The cache performance and optimizations of blocked algorithmsPublished by Association for Computing Machinery (ACM) ,1991
- More iteration space tilingPublished by Association for Computing Machinery (ACM) ,1989
- Strategies for cache and local memory management by global program transformationJournal of Parallel and Distributed Computing, 1988
- Supernode partitioningPublished by Association for Computing Machinery (ACM) ,1988
- Cache MemoriesACM Computing Surveys, 1982