Recursive Blocked Algorithms and Hybrid Data Structures for Dense Matrix Library Software
- 1 January 2004
- journal article
- review article
- Published by Society for Industrial & Applied Mathematics (SIAM) in SIAM Review
- Vol. 46 (1) , 3-45
- https://doi.org/10.1137/s0036144503428693
Abstract
Matrix computations are both fundamental and ubiquitous in computational science and its vast application areas. Along with the development of more advanced computer systems with complex memory hierarchies, there is a continuing demand for new algorithms and library software that efficiently utilize and adapt to new architecture features. This article reviews and details some of the recent advances made by applying the paradigm of recursion to dense matrix computations on today's memory-tiered computer systems. Recursion allows for efficient utilization of a memory hierarchy and generalizes existing fixed blocking by introducing automatic variable blocking that has the potential of matching every level of a deep memory hierarchy. Novel recursive blocked algorithms offer new ways to compute factorizations such as Cholesky and QR and to solve matrix equations. In fact, the whole gamut of existing dense linear algebra factorization is beginning to be reexamined in view of the recursive paradigm. Use of recursion has led to using new hybrid data structures and optimized superscalar kernels. The results we survey include new algorithms and library software implementations for level 3 kernels, matrix factorizations, and the solution of general systems of linear equations and several common matrix equations. The software implementations we survey are robust and show impressive performance on today's high performance computing systems.Keywords
This publication has 62 references indexed in Scilit:
- Recursive blocked algorithms for solving triangular systems—Part IIACM Transactions on Mathematical Software, 2002
- Recursive blocked algorithms for solving triangular systems—Part IACM Transactions on Mathematical Software, 2002
- Algorithm 784: GEMM-based level 3 BLASACM Transactions on Mathematical Software, 1998
- Compiler blockability of dense matrix factorizationsACM Transactions on Mathematical Software, 1997
- LAPACK-style algorithms and software for solving the generalized Sylvester equation and estimating the separation between regular matrix pairsACM Transactions on Mathematical Software, 1996
- On computing condition numbers for the nonsymmetric eigenproblemACM Transactions on Mathematical Software, 1993
- Perturbation theory and backward error forAX−XB=CBIT Numerical Mathematics, 1993
- Numerical Solution of the Stable, Non-negative Definite Lyapunov Equation Lyapunov EquationIMA Journal of Numerical Analysis, 1982
- A divide and conquer method for the symmetric tridiagonal eigenproblemNumerische Mathematik, 1980
- A Hessenberg-Schur method for the problem AX + XB= CIEEE Transactions on Automatic Control, 1979