Circuits for wide-window superscalar processors
- 1 May 2000
- journal article
- Published by Association for Computing Machinery (ACM) in ACM SIGARCH Computer Architecture News
- Vol. 28 (2) , 236-247
- https://doi.org/10.1145/342001.339689
Abstract
Our program benchmarks and simulations of novel circuits indicate that large-window processors are feasible. Using our redesigned superscalar components, a large-window processor implemented in today's technology can achieve an increase of 10-60% (geometric mean of 31%) in program speed compared to today's processors. The processor operates at clock speeds comparable to today's processors, but achieves significantly higher ILP. To measure the impact of a large window on clock speed, we design and simulate new implementations of the logic components that most limit the critical path of our large-window processor: the schedule logic and the wake-up logic. We use log-depth cyclic segmented prefix (CSP) circuits to reimplement these components. Our layouts and simulations of critical paths through these circuits indicate that our large-window processor could be clocked at frequencies exceeding 500MHz in today's technology. Our commit logic and rename logic can also run at these speeds. To measure the impact of a large window on ILP, we compare two microarchitectures, the first has a 128-instruction window, an 8-wide fetch unit, and 20-wide issue (four integer, branch, multiply, float, and memory units), whereas the second has a 32-instruction window, and a 4-wide fetch unit and is comparable to today's processors. For each, we simulate different window reuse and bypass policies. Our simulations show that the large-window processor achieves significantly higher IPC. This performance increase comes despite the fact that the large-window processor uses a wrap-around window while the small-window processor uses a compressing window, thus effectively increasing its number of outstanding instructions. Furthermore, the large-window processor sometimes pays an extra clock cycle for bypassing.Keywords
This publication has 7 references indexed in Scilit:
- A comparison of scalable superscalar processorsPublished by Association for Computing Machinery (ACM) ,1999
- The Alpha 21264 microprocessorIEEE Micro, 1999
- Issue logic for a 600-MHz out-of-order execution microprocessorIEEE Journal of Solid-State Circuits, 1998
- One billion transistors, one uniprocessor, one chipComputer, 1997
- Complexity-effective superscalar processorsPublished by Association for Computing Machinery (ACM) ,1997
- Using the SimOS machine simulator to study complex computer systemsACM Transactions on Modeling and Computer Simulation, 1997
- Simultaneous multithreadingACM SIGARCH Computer Architecture News, 1995