Parallel Block Matrix Factorizations on the Shared-Memory Multiprocessor Ibm 3090 VF/600J

1 April 1992

journal article
research article
Published by SAGE Publications in The International Journal of Supercomputing Applications

Vol. 6 (1) , 69-97
https://doi.org/10.1177/109434209200600105

Abstract

Efficient parallel block algorithms for the LU factorization with partial pivoting, the Cholesky factorization, and the QR factorization transportable over a range of parallel MIMD architectures are presented. Parallel implementations of different block algorithms that utilize optimized uniprocessor level-3 BLAS are compared with corresponding routines of LAPACK (under development). Parallelism is mainly invoked implicitly in LAPACK by replacing calls to uniprocessor level-3 kernels by calls to parallel level-3 kernels and thereby maintaining portability. However, by parallelizing at the block level (explicitly) it is possible to overlap and pipeline different matrix-matrix operations and thereby gain some performance. Theoretical models give upper bounds on the best possible speedup of the explicitly and implicitly parallel block algorithms for the target machine.

Keywords

This publication has 13 references indexed in Scilit:

Parallel Algorithms for Dense Linear Algebra Computations
SIAM Review, 1990
A set of level 3 basic linear algebra subprograms
ACM Transactions on Mathematical Software, 1990
Adaptive blocking in the QR factorization
The Journal of Supercomputing, 1989
A Storage-Efficient $WY$ Representation for Products of Householder Transformations
SIAM Journal on Scientific and Statistical Computing, 1989
Storage hierarchies
IBM Systems Journal, 1989
Programming in VS Fortran on the IBM 3090 for maximum vector performance
Computer, 1988
An extended set of FORTRAN basic linear algebra subprograms
ACM Transactions on Mathematical Software, 1988
Impact of Hierarchical Memory Systems On Linear Algebra Algorithm Design
The International Journal of Supercomputing Applications, 1988
The WY Representation for Products of Householder Matrices
SIAM Journal on Scientific and Statistical Computing, 1987
Basic Linear Algebra Subprograms for Fortran Usage
ACM Transactions on Mathematical Software, 1979