AnySP
- 15 June 2009
- journal article
- Published by Association for Computing Machinery (ACM) in ACM SIGARCH Computer Architecture News
- Vol. 37 (3) , 128-139
- https://doi.org/10.1145/1555815.1555773
Abstract
In the past decade, the proliferation of mobile devices has increased at a spectacular rate. There are now more than 3.3 billion active cell phones in the world-a device that we now all depend on in our daily lives. The current generation of devices employs a combination of general-purpose processors, digital signal processors, and hardwired accelerators to provide giga-operations-per-second performance on milliWatt power budgets. Such heterogeneous organizations are inefficient to build and maintain, as well as waste silicon area and power. Looking forward to the next generation of mobile computing, computation requirements will increase by one to three orders of magnitude due to higher data rates, increased complexity algorithms, and greater computation diversity but the power requirements will be just as stringent. Scaling of existing approaches will not suffice instead the inherent computational efficiency, programmability, and adaptability of the hardware must change. To overcome these challenges, this paper proposes an example architecture, referred to as AnySP, for the next generation mobile signal processing. AnySP uses a co-design approach where the next generation wireless signal processing and high-definition video algorithms are analyzed to create a domain specific programmable architecture. At the heart of AnySP is a configurable single-instruction multiple-data datapath that is capable of processing wide vectors or multiple narrow vectors simultaneously. In addition, deeper computation subgraphs can be pipelined across the single-instruction multiple-data lanes. These three operating modes provide high throughput across varying application types. Results show that AnySP is capable of sustaining 4G wireless processing and high-definition video throughput rates, and will approach the 1000 Mops/mW efficiency barrier when scaled to 45nm.Keywords
This publication has 22 references indexed in Scilit:
- Design and Analysis of LDPC Decoders for Software Defined Radio2007 IEEE Workshop on Signal Processing Systems, 2007
- A Customized Cross-Bar for Data-Shuffling in Domain-Specific SIMD ProcessorsPublished by Springer Nature ,2007
- Design and Implementation of Turbo Decoders for Software Defined Radio2007 IEEE Workshop on Signal Processing Systems, 2006
- Bypass aware instruction scheduling for register file power reductionPublished by Association for Computing Machinery (ACM) ,2006
- XiSystem: a XiRisc-based SoC with reconfigurable IO moduleIEEE Journal of Solid-State Circuits, 2005
- Vector Processing as an Enabler for Software-Defined Radio in Handheld DevicesEURASIP Journal on Advances in Signal Processing, 2005
- An Architecture Framework for Transparent Instruction Set Customization in Embedded ProcessorsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- ADRES: An Architecture with Tightly Coupled VLIW Processor and Coarse-Grained Reconfigurable MatrixPublished by Springer Nature ,2003
- Implementing streaming SIMD extensions on the Pentium III processorIEEE Micro, 2000
- Intel MMX for multimedia PCsCommunications of the ACM, 1997