Universal mechanisms for data-parallel architectures
- 6 May 2004
- proceedings article
- Published by Institute of Electrical and Electronics Engineers (IEEE)
Abstract
Data-parallel programs are both growing in importanceand increasing in diversity, resulting in specialized processorstargeted at specific classes of these programs. This paperpresents a classification scheme for data-parallelprogram attributes, and proposes micro-architecturalmechanisms to support applications with diverse behaviorusing a single reconfigurable architecture. We focuson the following four broad kinds of data-parallel programs- DSP/multimedia, scientific, networking, andreal-time graphics workloads. While all of these programsexhibit high computational intensity, coarse-grainregular control behavior, and some regular memory accessbehavior, they show wide variance in the computationrequirements, fine grain control behavior, and the frequencyof other types of memory accesses. Based onthis study of application attributes, this paper proposesa set of general micro-architectural mechanismsthat enable a baseline architecture to be dynamically tailoredto the demands of a particular application. Thesemechanisms provide efficient execution across a spectrumof data-parallel applications and can be applied todiverse architectures ranging from vector cores to conventionalsuperscalar cores. Our results using a baselineTRIPS processor show that the configurability of the architectureto the application demands provides harmonicmean performance improvement of 5%-55% over scalableyet less flexible architectures, and performs competitivelyagainst specialized architectures.Keywords
This publication has 15 references indexed in Scilit:
- CgPublished by Association for Computing Machinery (ACM) ,2003
- Custom-fit processors: letting applications define architecturesPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- A bandwidth-efficient architecture for media processingPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Efficient conditional operations for data-parallel architecturesPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- The Raw microprocessor: a computational fabric for software circuits and general-purpose programsIEEE Micro, 2002
- Vector unit architecture for emotion synthesisIEEE Micro, 2000
- Out-of-order vector architecturesPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1997
- Cheops: a reconfigurable data-flow system for video processingIEEE Transactions on Circuits and Systems for Video Technology, 1995
- Reality Engine graphicsPublished by Association for Computing Machinery (ACM) ,1993
- The CRAY-1 computer systemCommunications of the ACM, 1978