Using interaction costs for microarchitectural bottleneck analysis
- 25 May 2004
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
Abstract
Attacking bottlenecks in modern processors is difficult because many microarchitectural events overlap with each other. This parallelism makes it difficult to both: (a) assign a cost to an event (e.g., to one of two overlapping cache misses); and (b) assign blame for each cycle (e.g., for a cycle where many, overlapping resources are active). This paper introduces a new model for understanding event costs to facilitate processor design and optimization. First, we observe that everything in a machine (instructions, hardware structures, events) can interact in only one of two ways (in parallel or serially). We quantify these interactions by defining interaction cost, which can be zero (independent, no interaction), positive (parallel), or negative (serial). Second, we illustrate the value of using interaction costs in processor design and optimization. Finally, we propose performance-monitoring hardware for measuring interaction costs that is suitable for modern processors.Keywords
This publication has 25 references indexed in Scilit:
- Loose loops sink chipsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2004
- Load latency tolerance in dynamically scheduled processorsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Improving trace cache effectiveness with branch promotion and trace packingPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Joint local and global hardware adaptations for energyPublished by Association for Computing Machinery (ACM) ,2002
- Focusing processor policies via critical-path predictionPublished by Association for Computing Machinery (ACM) ,2001
- Performance of database workloads on shared-memory systems with out-of-order processorsPublished by Association for Computing Machinery (ACM) ,1998
- Continuous profilingACM Transactions on Computer Systems, 1997
- The SimpleScalar tool set, version 2.0ACM SIGARCH Computer Architecture News, 1997
- Performance analysis using the MIPS R10000 performance countersPublished by Association for Computing Machinery (ACM) ,1996
- Whole-Genome Random Sequencing and Assembly of Haemophilus influenzae RdScience, 1995