Parallel Block Matrix Factorizations on the Shared-Memory Multiprocessor Ibm 3090 VF/600J
- 1 April 1992
- journal article
- research article
- Published by SAGE Publications in The International Journal of Supercomputing Applications
- Vol. 6 (1) , 69-97
- https://doi.org/10.1177/109434209200600105
Abstract
Efficient parallel block algorithms for the LU factorization with partial pivoting, the Cholesky factorization, and the QR factorization transportable over a range of parallel MIMD architectures are presented. Parallel implementations of different block algorithms that utilize optimized uniprocessor level-3 BLAS are compared with corresponding routines of LAPACK (under development). Parallelism is mainly invoked implicitly in LAPACK by replacing calls to uniprocessor level-3 kernels by calls to parallel level-3 kernels and thereby maintaining portability. However, by parallelizing at the block level (explicitly) it is possible to overlap and pipeline different matrix-matrix operations and thereby gain some performance. Theoretical models give upper bounds on the best possible speedup of the explicitly and implicitly parallel block algorithms for the target machine.Keywords
This publication has 13 references indexed in Scilit:
- Parallel Algorithms for Dense Linear Algebra ComputationsSIAM Review, 1990
- A set of level 3 basic linear algebra subprogramsACM Transactions on Mathematical Software, 1990
- Adaptive blocking in the QR factorizationThe Journal of Supercomputing, 1989
- A Storage-Efficient $WY$ Representation for Products of Householder TransformationsSIAM Journal on Scientific and Statistical Computing, 1989
- Storage hierarchiesIBM Systems Journal, 1989
- Programming in VS Fortran on the IBM 3090 for maximum vector performanceComputer, 1988
- An extended set of FORTRAN basic linear algebra subprogramsACM Transactions on Mathematical Software, 1988
- Impact of Hierarchical Memory Systems On Linear Algebra Algorithm DesignThe International Journal of Supercomputing Applications, 1988
- The WY Representation for Products of Householder MatricesSIAM Journal on Scientific and Statistical Computing, 1987
- Basic Linear Algebra Subprograms for Fortran UsageACM Transactions on Mathematical Software, 1979