Blocked algorithms and software for reduction of a regular matrix pair to generalized Schur form
- 1 December 1999
- journal article
- Published by Association for Computing Machinery (ACM) in ACM Transactions on Mathematical Software
- Vol. 25 (4) , 425-454
- https://doi.org/10.1145/332242.332244
Abstract
A two-stage blocked algorithm for reduction of a regular matrix pair ( A , B ) to upper Hessenberg-triangular form is presented. In stage 1 ( A, B is reduced to block upper Hessenberg-triangular form using mainly level 3 (matrix-matrix) operations that permit data reuse in the higher levels of a memory hierarchy. In the second stage all but one of the r subdiagonals of the block Hessenberg A -part are set to zero using Givens rotations. The algorithm proceeds in a sequence of supersweeps, each reducing m columns. The updates with respect to row and column rotations are organized to reference consecutive columns of A and B . To further improve the data locality, all rotations produced in a supersweep are stored to enable a left-looking reference pattern, i.e., all updates are delayed until they are required for the continuation of the supersweep. Moreover, we present a blocked variant of the single-diagonal double-shift QZ method for computing the generalized Schur form of ( A, B in upper Hessenberg-triangular form. The blocking for improved data locality is done similarly, now by restructuring the reference pattern of the updates associated with the bulge chasing in the QZ iteration. Timing results show that our new blocked variants outperform the current LAPACK routines, including drivers for the generalized eigenvalue problem, by a factor 2-5 for sufficiently large problems.Keywords
This publication has 16 references indexed in Scilit:
- GEMM-based level 3 BLASACM Transactions on Mathematical Software, 1998
- Algorithm 784: GEMM-based level 3 BLASACM Transactions on Mathematical Software, 1998
- ScaLAPACK Users' GuidePublished by Society for Industrial & Applied Mathematics (SIAM) ,1997
- A parallel algorithm for the reduction of a nonsymmetric matrix to block upper-Hessenberg formParallel Computing, 1995
- A multishift QR iteration without computation of the shiftsNumerical Algorithms, 1994
- A Parallel Algorithm for Reducing Symmetric Banded Matrices to Tridiagonal FormSIAM Journal on Scientific Computing, 1993
- Algorithm 679: A set of level 3 basic linear algebra subprograms: model implementation and test programsACM Transactions on Mathematical Software, 1990
- ON A BLOCK IMPLEMENTATION OF HESSENBERG MULTISHIFT QR ITERATIONInternational Journal of High Speed Computing, 1989
- The WY Representation for Products of Householder MatricesSIAM Journal on Scientific and Statistical Computing, 1987
- A note on the efficient solution of matrix pencil systemsBIT Numerical Mathematics, 1978