An Architecture Framework for Transparent Instruction Set Customization in Embedded Processors
- 28 July 2005
- proceedings article
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- Vol. cs tr 1999 (2) , 272-283
- https://doi.org/10.1109/isca.2005.9
Abstract
Instruction set customization is an effective way to improve processor performance. Critical portions of applicationdata-flow graphs are collapsed for accelerated execution on specialized hardware. Collapsing dataflow subgraphs will compress the latency along critical paths and reduces the number of intermediate results stored in the register file. While custom instructions can be effective, the time and cost of designing a new processor for each application is immense. To overcome this roadblock, this paper proposes a flexible architectural framework to transparently integrate custom instructions into a general-purpose processor. Hardware accelerators are added to the processor to execute the collapsed subgraphs. A simple microarchitectural interface is provided to support a plug-and-play model for integrating a wide range of accelerators into a pre-designed and verified processor core. The accelerators are exploited using an approach of static identification and dynamic realization. The compiler is responsible for identifying profitable subgraphs, while the hardware handles discovery, mapping, and execution of compatible subgraphs. This paper presents the design of a plug-and-play transparent accelerator system and evaluates the cost/performance implications of the design.Keywords
This publication has 22 references indexed in Scilit:
- Application-Specific Processing on a General-Purpose Core via Transparent Instruction Set CustomizationPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- From Sequences of Dependent Instructions to FunctionsACM SIGARCH Computer Architecture News, 2004
- Automatic generation of application specific processorsPublished by Association for Computing Machinery (ACM) ,2003
- Automatic application-specific instruction-set extensions under microarchitectural constraintsPublished by Association for Computing Machinery (ACM) ,2003
- SimpleScalar: an infrastructure for computer system modelingComputer, 2002
- rePLay: A hardware framework for dynamic optimizationIEEE Transactions on Computers, 2001
- Code Generation and Optimization Techniques for Embedded Digital Signal ProcessorsPublished by Springer Nature ,1996
- High-performance 3-1 interlock collapsing ALU'sIEEE Transactions on Computers, 1994
- The superblock: An effective technique for VLIW and superscalar compilationThe Journal of Supercomputing, 1993
- Code generation using tree matching and dynamic programmingACM Transactions on Programming Languages and Systems, 1989