Implementing sparse matrix-vector multiplication on throughput-oriented processors
Top Cited Papers
- 14 November 2009
- proceedings article
- Published by Association for Computing Machinery (ACM)
Abstract
No abstract availableThis publication has 11 references indexed in Scilit:
- Concurrent number cruncher: a GPU implementation of a general sparse linear solverInternational Journal of Parallel, Emergent and Distributed Systems, 2009
- Sparse matrix computations on manycore GPU'sPublished by Association for Computing Machinery (ACM) ,2008
- NVIDIA Tesla: A Unified Graphics and Computing ArchitectureIEEE Micro, 2008
- Scalable Parallel Programming with CUDAQueue, 2008
- Solving Dense Linear Systems on Graphics ProcessorsPublished by Springer Nature ,2008
- Optimization of sparse matrix-vector multiplication on emerging multicore platformsPublished by Association for Computing Machinery (ACM) ,2007
- Sparsity: Optimization Framework for Sparse Matrix KernelsThe International Journal of High Performance Computing Applications, 2004
- Toward the Optimal Preconditioned Eigensolver: Locally Optimal Block Preconditioned Conjugate Gradient MethodSIAM Journal on Scientific Computing, 2001
- Implementation of a Portable Nested Data-Parallel LanguageJournal of Parallel and Distributed Computing, 1994
- Basic Linear Algebra Subprograms for Fortran UsageACM Transactions on Mathematical Software, 1979