Minimizing development and maintenance costs in supporting persistently optimized BLAS
- 12 January 2005
- journal article
- research article
- Published by Wiley in Software: Practice and Experience
- Vol. 35 (2) , 101-121
- https://doi.org/10.1002/spe.626
Abstract
No abstract availableKeywords
This publication has 13 references indexed in Scilit:
- Recursive Blocked Algorithms and Hybrid Data Structures for Dense Matrix Library SoftwareSIAM Review, 2004
- Automated empirical optimizations of software and the ATLAS projectParallel Computing, 2001
- GEMM-based level 3 BLASACM Transactions on Mathematical Software, 1998
- Locality of Reference in LU Decomposition with Partial PivotingSIAM Journal on Matrix Analysis and Applications, 1997
- Compiler transformations for high-performance computingACM Computing Surveys, 1994
- A parallel block implementation of Level-3 BLAS for MIMD vector processorsACM Transactions on Mathematical Software, 1994
- A set of level 3 basic linear algebra subprogramsACM Transactions on Mathematical Software, 1990
- An extended set of FORTRAN basic linear algebra subprogramsACM Transactions on Mathematical Software, 1988
- Algorithm 656: an extended set of basic linear algebra subprograms: model implementation and test programsACM Transactions on Mathematical Software, 1988
- Basic Linear Algebra Subprograms for Fortran UsageACM Transactions on Mathematical Software, 1979