Auto-blocking matrix-multiplication or tracking BLAS3 performance from source code
- 21 June 1997
- journal article
- Published by Association for Computing Machinery (ACM) in ACM SIGPLAN Notices
- Vol. 32 (7) , 206-216
- https://doi.org/10.1145/263767.263789
Abstract
No abstract availableThis publication has 19 references indexed in Scilit:
- Compiler blockability of dense matrix factorizationsACM Transactions on Mathematical Software, 1997
- LogPCommunications of the ACM, 1996
- Data and computation transformations for multiprocessorsACM SIGPLAN Notices, 1995
- A model and compilation strategy for out-of-core data parallel programsACM SIGPLAN Notices, 1995
- Unifying data and control transformations for distributed shared-memory machinesACM SIGPLAN Notices, 1995
- Stability of block algorithms with fast level-3 BLASACM Transactions on Mathematical Software, 1992
- Retire Fortran?Communications of the ACM, 1992
- An extended set of FORTRAN basic linear algebra subprogramsACM Transactions on Mathematical Software, 1988
- Short Notes: Comment on 'The Explicit Quad Tree as a Structure for Computer Graphics'The Computer Journal, 1983
- On the implementation of Strassen's fast multiplication algorithmActa Informatica, 1976