Instruction distribution heuristics for quad-cluster, dynamically-scheduled, superscalar processors

1 December 2000

proceedings article
Published by Association for Computing Machinery (ACM)

p. 337-347
https://doi.org/10.1145/360128.360165

Abstract

We investigate instruction distribution methods for quad- cluster, dynamically-scheduled superscalar processors. We study a variety of methods with different cost, performance and complexity characteristics. We investigate both non-adaptive and adaptive methods and their sensitivity both to inter-cluster communication latencies and pipeline depth. Furthermore, we develop a set of models that allow us to identify how well each method attacks issue-bandwidth and inter-cluster communication restrictions. We find that a relatively simple method that changes clusters every other three instructions offers only a 17% performance slowdown compared to a non- clustered configuration operating at the same frequency. Moreover, we show that by utilizing adaptive methods it is possible to further reduce this gap down to about 14%. Furthermore, performance appears to be more sensitive to inter-cluster communication latencies rather than to pipeline depth. The best performing method offers a slowdown of about 24% when inter-cluster communication latency is two cycle. This gap is only 20% when two additional stages are introduced in the front-end pipeline.

Keywords

This publication has 7 references indexed in Scilit:

Interconnect scaling-the real limiter to high performance ULSI
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
Data speculation support for a chip multiprocessor
Published by Association for Computing Machinery (ACM) ,1998
Complexity-effective superscalar processors
Published by Association for Computing Machinery (ACM) ,1997
Dynamic speculation and synchronization of data dependences
Published by Association for Computing Machinery (ACM) ,1997
Will physical scalability sabotage performance gains?
Computer, 1997
Multiscalar processors
Published by Association for Computing Machinery (ACM) ,1995
Instruction issue logic for high-performance, interruptable pipelined processors
Published by Association for Computing Machinery (ACM) ,1987