Understanding throughput-oriented architectures

Publisher Website

1 November 2010

journal article
research article
Published by Association for Computing Machinery (ACM) in Communications of the ACM

Vol. 53 (11) , 58-66
https://doi.org/10.1145/1839676.1839694

Abstract

For workloads with abundant parallelism, GPUs deliver higher peak computational throughput than latency-oriented CPUs.

Keywords

This publication has 22 references indexed in Scilit:

Implementing sparse matrix-vector multiplication on throughput-oriented processors
Published by Association for Computing Machinery (ACM) ,2009
Scalable Parallel Programming with CUDA
Queue, 2008
Niagara: A 32-Way Multithreaded Sparc Processor
IEEE Micro, 2005
The Vector-Thread Architecture
ACM SIGARCH Computer Architecture News, 2004
A survey of processors with explicit multithreading
ACM Computing Surveys, 2003
Vector architectures
Published by Association for Computing Machinery (ACM) ,1998
Exploiting heterogeneous parallelism on a multithreaded multiprocessor
Published by Association for Computing Machinery (ACM) ,1992
The Tera computer system
Published by Association for Computing Machinery (ACM) ,1990
The CRAY-1 computer system
Communications of the ACM, 1978
Merging with parallel processors
Communications of the ACM, 1975