Programming in VS Fortran on the IBM 3090 for maximum vector performance
- 1 June 1988
- journal article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in Computer
- Vol. 21 (6) , 65-76
- https://doi.org/10.1109/2.951
Abstract
Programming techniques necessary for high performance on the 3090 Vector Facilities are illustrated, showing that VS Fortran programs can achieve near-maximum execution rates. Relevant features of the 3090 architecture are reviewed, stressing the need to make efficient use of a hierarchical storage system and take advantage of the compound vector instructions. The key programming techniques for managing the storage hierarchy are loop sectioning, loop distribution, and data compaction. Vector register, cache reuse, and virtual memory, storage format, and page reuse are shown to lead to efficient use of the vector registers, the high speed cache, and the virtual memory system, respectively. The multiply-and-add compound instruction is discussed.<>Keywords
This publication has 8 references indexed in Scilit:
- A vectorizing Fortran compilerIBM Journal of Research and Development, 1986
- Squeezing the most out of an algorithm in CRAY FORTRANACM Transactions on Mathematical Software, 1984
- Automatic loop interchangePublished by Association for Computing Machinery (ACM) ,1984
- Implementing Linear Algebra Algorithms for Dense Matrices on a Vector Pipeline MachineSIAM Review, 1984
- On the Performance Enhancement of Paging Systems Through Program Analysis and TransformationsIEEE Transactions on Computers, 1981
- Storage reorganization techniques for matrix computation in a paging environmentCommunications of the ACM, 1979
- On the Paging Performance of Array AlgorithmsIEEE Transactions on Computers, 1977
- Organizing matrices and matrix operations for paged memory systemsCommunications of the ACM, 1969