Use of Level 3 Blas in Lu Factorization in a Multiprocessing Environment On Three Vector Multiprocessors: the Alliant Fx/80, the Cray-2, and the Ibm 3090 Vf

1 September 1991

journal article
other
Published by SAGE Publications in The International Journal of Supercomputing Applications

Vol. 5 (3) , 92-110
https://doi.org/10.1177/109434209100500308

Abstract

We study various implementations of block Gaussian elimination on full matrices and examine their perfor mance on three parallel computers, the Alliant FX/80, the CRAY-2, and the IBM 3090-400/VF. These imple mentations are expressed in terms of Level 3 BLAS matrix-matrix kernels. We consider the use of parallel Level 3 BLAS kernels and compare the parallelism ob tained within the computational kernels with that ob tained when parallelizing over the kernels. We show that the use of parallel Level 3 BLAS allows portability without sacrifice of efficiency, even in a parallel envi ronment, and that high speeds can be obtained if tuned versions of the kernels are available.

Keywords

This publication has 13 references indexed in Scilit:

Parallel Algorithms for Dense Linear Algebra Computations
SIAM Review, 1990
A set of level 3 basic linear algebra subprograms
ACM Transactions on Mathematical Software, 1990
Algorithm 679: A set of level 3 basic linear algebra subprograms: model implementation and test programs
ACM Transactions on Mathematical Software, 1990
Vectorization of a Multiprocessor Multifrontal Code
The International Journal of Supercomputing Applications, 1989
Level 3 Blas in Lu Factorization On the Cray-2, Eta-10P, and Ibm 3090-200/Vf
The International Journal of Supercomputing Applications, 1989
An extended set of FORTRAN basic linear algebra subprograms
ACM Transactions on Mathematical Software, 1988
Impact of Hierarchical Memory Systems On Linear Algebra Algorithm Design
The International Journal of Supercomputing Applications, 1988
The Use of BLAS3 in Linear Algebra on a Parallel Processor with a Hierarchical Memory
SIAM Journal on Scientific and Statistical Computing, 1987
The WY Representation for Products of Householder Matrices
SIAM Journal on Scientific and Statistical Computing, 1987
Implementing Linear Algebra Algorithms for Dense Matrices on a Vector Pipeline Machine
SIAM Review, 1984