Toward kilo-instruction processors
Open Access
- 1 December 2004
- journal article
- Published by Association for Computing Machinery (ACM) in ACM Transactions on Architecture and Code Optimization
- Vol. 1 (4) , 389-417
- https://doi.org/10.1145/1044823.1044825
Abstract
The continuously increasing gap between processor and memory speeds is a serious limitation to the performance achievable by future microprocessors. Currently, processors tolerate long-latency memory operations largely by maintaining a high number of in-flight instructions. In the future, this may require supporting many hundreds, or even thousands, of in-flight instructions. Unfortunately, the traditional approach of scaling up critical processor structures to provide such support is impractical at these levels, due to area, power, and cycle time constraints.In this paper we show that, in order to overcome this resource-scalability problem, the way in which critical processor resources are managed must be changed. Instead of simply upsizing the processor structures, we propose a smarter use of the available resources, supported by a selective checkpointing mechanism. This mechanism allows instructions to commit out of order, and makes a reorder buffer unnecessary. We present a set of techniques such as multilevel instruction queues, late allocation and early release of registers, and early release of load/store queue entries. All together, these techniques constitute what we call a kilo-instruction processor , an architecture that can support thousands of in-flight instructions, and thus may achieve high performance even in the presence of large memory access latencies.Keywords
This publication has 8 references indexed in Scilit:
- Out-of-Order Commit ProcessorsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2004
- Runahead execution: an effective alternative to large instruction windowsIEEE Micro, 2003
- Checkpoint processing and recovery: an efficient, scalable alternative to reorder buffersIEEE Micro, 2003
- A case for resource-conscious out-of-order processorsPublished by Association for Computing Machinery (ACM) ,2003
- DTM, DVS, optimal controlIEEE Computer Architecture Letters, 2003
- POWER4 system microarchitectureIBM Journal of Research and Development, 2002
- Cache MemoriesACM Computing Surveys, 1982
- Space/time trade-offs in hash coding with allowable errorsCommunications of the ACM, 1970