Implementing sparse matrix-vector multiplication on throughput-oriented processors

Top Cited Papers

Publisher Website

14 November 2009

proceedings article
Published by Association for Computing Machinery (ACM)

Abstract

No abstract available

This publication has 11 references indexed in Scilit:

Concurrent number cruncher: a GPU implementation of a general sparse linear solver
International Journal of Parallel, Emergent and Distributed Systems, 2009
Sparse matrix computations on manycore GPU's
Published by Association for Computing Machinery (ACM) ,2008
NVIDIA Tesla: A Unified Graphics and Computing Architecture
IEEE Micro, 2008
Scalable Parallel Programming with CUDA
Queue, 2008
Solving Dense Linear Systems on Graphics Processors
Published by Springer Nature ,2008
Optimization of sparse matrix-vector multiplication on emerging multicore platforms
Published by Association for Computing Machinery (ACM) ,2007
Sparsity: Optimization Framework for Sparse Matrix Kernels
The International Journal of High Performance Computing Applications, 2004
Toward the Optimal Preconditioned Eigensolver: Locally Optimal Block Preconditioned Conjugate Gradient Method
SIAM Journal on Scientific Computing, 2001
Implementation of a Portable Nested Data-Parallel Language
Journal of Parallel and Distributed Computing, 1994
Basic Linear Algebra Subprograms for Fortran Usage
ACM Transactions on Mathematical Software, 1979