LU-GPU: Efficient Algorithms for Solving Dense Linear Systems on Graphics Hardware
- 22 December 2005
- proceedings article
- Published by Institute of Electrical and Electronics Engineers (IEEE)
Abstract
We present a novel algorithm to solve dense linear systems using graphics processors (GPUs). We reduce matrix decomposition and row operations to a series of rasterization problems on the GPU. These include new techniques for streaming index pairs, swapping rows and columns and parallelizing the computation to utilize multiple vertex and fragment processors. We also use appropriate data representations to match the rasterization order and cache technology of graphics processors. We have implemented our algorithm on different GPUs and compared the performance with optimized CPU implementations. In particular, our implementation on a NVIDIA GeForce 7800 GPU outperforms a CPU-based ATLAS implementation. Moreover, our results show that our algorithm is cache and bandwidth efficient and scales well with the number of fragment processors within the GPU and the core GPU clock rate. We use our algorithm for fluid flow simulation and demonstrate that the commodity GPU is a useful co-processor for many scientific applications.Keywords
This publication has 20 references indexed in Scilit:
- Brook for GPUsACM Transactions on Graphics, 2004
- Shader algebraACM Transactions on Graphics, 2004
- Evaluating the Imagine Stream ArchitectureACM SIGARCH Computer Architecture News, 2004
- The LINPACK Benchmark: past, present and futureConcurrency and Computation: Practice and Experience, 2003
- Linear algebra operators for GPU implementation of numerical algorithmsACM Transactions on Graphics, 2003
- Sparse matrix solvers on the GPUACM Transactions on Graphics, 2003
- The Raw microprocessor: a computational fabric for software circuits and general-purpose programsIEEE Micro, 2002
- Automated empirical optimizations of software and the ATLAS projectParallel Computing, 2001
- Kinetic Theory for Bubbly Flow II: Fluid Dynamic LimitSIAM Journal on Applied Mathematics, 1996
- Kinetic Theory for Bubbly Flow I: Collisionless caseSIAM Journal on Applied Mathematics, 1996