Improving Multiprocessor Performance with Coarse-Grain Coherence Tracking
- 28 July 2005
- proceedings article
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- Vol. 33 (2) , 246-257
- https://doi.org/10.1109/isca.2005.31
Abstract
To main coherence in conventional shared-memory multiprocessor systems, processors first check other proessors' caches before obtaining data from memory. This coherence checking adds latency to memory requests and leads to large amounts of interconnect traffic in broadcast-based systems. Our results for a set of commercial, scientific and multiprogrammed workloads show that on average 67% (and up to 94%) of broadcasts are unnecessary. Coarse-Grain Coherence Tracking is a new technique that supplements a conventional coherence mechanism and optimizes the performance of coherence enforcement. The Coarse-Grain Coherence mechanism monitors the coherence status of large regions of memory, and uses that information to avoid unnecessary broadcasts. Coarse-Grain Coherence Tracking is shown to eliminate 55-97% of the unnecessary broadcasts, and improve performance by 8.8% on average (and up to 21.7%).Keywords
This publication has 18 references indexed in Scilit:
- RegionScout: Exploiting Coarse Grain Sharing in Snoop-Based CoherencePublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- Simulating a $2M commercial server on a $2K PCComputer, 2003
- Using destination-set prediction to improve the latency/bandwidth tradeoff in shared-memory multiprocessorsPublished by Association for Computing Machinery (ACM) ,2003
- Token coherencePublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- Dynamic self-invalidation: reducing coherence overhead in shared-memory multiprocessorsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- JETTY: filtering snoops for reduced energy consumption in SMP serversPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- TLB and snoop energy-reduction using virtual caches in low-power chip-multiprocessorsPublished by Association for Computing Machinery (ACM) ,2002
- Timestamp snoopingPublished by Association for Computing Machinery (ACM) ,2000
- Isotach networksIEEE Transactions on Parallel and Distributed Systems, 1997
- Adjustable block size coherent cachesPublished by Association for Computing Machinery (ACM) ,1992