Instruction distribution heuristics for quad-cluster, dynamically-scheduled, superscalar processors
- 1 December 2000
- proceedings article
- Published by Association for Computing Machinery (ACM)
- p. 337-347
- https://doi.org/10.1145/360128.360165
Abstract
We investigate instruction distribution methods for quad- cluster, dynamically-scheduled superscalar processors. We study a variety of methods with different cost, performance and complexity characteristics. We investigate both non-adaptive and adaptive methods and their sensitivity both to inter-cluster communication latencies and pipeline depth. Furthermore, we develop a set of models that allow us to identify how well each method attacks issue-bandwidth and inter-cluster communication restrictions. We find that a relatively simple method that changes clusters every other three instructions offers only a 17% performance slowdown compared to a non- clustered configuration operating at the same frequency. Moreover, we show that by utilizing adaptive methods it is possible to further reduce this gap down to about 14%. Furthermore, performance appears to be more sensitive to inter-cluster communication latencies rather than to pipeline depth. The best performing method offers a slowdown of about 24% when inter-cluster communication latency is two cycle. This gap is only 20% when two additional stages are introduced in the front-end pipeline.Keywords
This publication has 7 references indexed in Scilit:
- Interconnect scaling-the real limiter to high performance ULSIPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Data speculation support for a chip multiprocessorPublished by Association for Computing Machinery (ACM) ,1998
- Complexity-effective superscalar processorsPublished by Association for Computing Machinery (ACM) ,1997
- Dynamic speculation and synchronization of data dependencesPublished by Association for Computing Machinery (ACM) ,1997
- Will physical scalability sabotage performance gains?Computer, 1997
- Multiscalar processorsPublished by Association for Computing Machinery (ACM) ,1995
- Instruction issue logic for high-performance, interruptable pipelined processorsPublished by Association for Computing Machinery (ACM) ,1987