Compiler transformations for high-performance computing
- 1 December 1994
- journal article
- Published by Association for Computing Machinery (ACM) in ACM Computing Surveys
- Vol. 26 (4) , 345-420
- https://doi.org/10.1145/197405.197406
Abstract
In the last three decades a large number of compiler transformations for optimizing programs have been implemented. Most optimizations for uniprocessors reduce the number of instructions executed by the program using transformations based on the analysis of scalar quantities and data-flow techniques. In contrast, optimizations for high-performance superscalar, vector, and parallel processors maximize parallelism and memory locality with transformations that rely on tracking the properties of arrays using loop dependence analysis. This survey is a comprehensive overview of the important high-level program restructuring techniques for imperative languages, such as C and Fortran. Transformations for both sequential and various types of parallel architectures are covered in depth. We describe the purpose of each transformation, explain how to determine if it is legal, and give an example of its application. Programmers wishing to enhance the performance of their code can use this survey to improve their understanding of the optimizations that compilers can perform, or as a reference for techniques to be applied manually. Students can obtain an overview of optimizing compiler technology. Compiler writers can use this survey as a reference for most of the important optimizations developed to date, and as bibliographic reference for the details of each optimization. Readers are expected to be familiar with modern computer architecture and basic program compilation techniques.Keywords
This publication has 104 references indexed in Scilit:
- Global optimizations for parallelism and locality on scalable parallel machinesACM SIGPLAN Notices, 1993
- Architecture of the Pentium microprocessorIEEE Micro, 1993
- Compilation of Haskell array comprehensions for scientific computingACM SIGPLAN Notices, 1990
- An overview of the PTRAN analysis system for multiprocessingJournal of Parallel and Distributed Computing, 1988
- Optimal loop parallelizationPublished by Association for Computing Machinery (ACM) ,1988
- Automatic translation of FORTRAN programs to vector formACM Transactions on Programming Languages and Systems, 1987
- Conversion of control dependence to data dependencePublished by Association for Computing Machinery (ACM) ,1983
- On the Performance Enhancement of Paging Systems Through Program Analysis and TransformationsIEEE Transactions on Computers, 1981
- Code Generation for Expressions with Common SubexpressionsJournal of the ACM, 1977
- An Algorithm for Translating Boolean ExpressionsJournal of the ACM, 1962