Evaluation of mechanisms for fine-grained parallel programs in the J-machine and the CM-5
- 1 May 1993
- journal article
- Published by Association for Computing Machinery (ACM) in ACM SIGARCH Computer Architecture News
- Vol. 21 (2) , 302-313
- https://doi.org/10.1145/173682.165165
Abstract
This paper uses an abstract machine approach to compare the mechanisms of two parallel machines: the J-Machine and the CM-5. High-level parallel programs are translated by a single optimizing compiler to a fine-grained abstract parallel machine, TAM. A final compilation step is unique to each machine and optimizes for specifics of the architecture. By determining the cost of the primitives and weighting them by their dynamic frequency in parallel programs, we quantify the effectiveness of the following mechanisms individually and in combination. Efficient processor/network coupling proves valuable. Message dispatch is found to be less valuable without atomic operations that allow the scheduling levels to cooperate. Multiple hardware contexts are of small value when the contexts cooperate and the compiler can partition the register set. Tagged memory provides little gain. Finally, the performance of the overall system is strongly influenced by the performance of the memory system and the frequency of control operations.Keywords
This publication has 4 references indexed in Scilit:
- Fine-grain parallelism with minimal hardware support: a compiler-controlled threaded abstract machinePublished by Association for Computing Machinery (ACM) ,1991
- The NAS parallel benchmarks---summary and preliminary resultsPublished by Association for Computing Machinery (ACM) ,1991
- Vectorization on Monte Carlo particle transport: an architectural study using the LANL benchmark “GAMTEB”Published by Association for Computing Machinery (ACM) ,1989
- Future scientific programming on parallel machinesJournal of Parallel and Distributed Computing, 1988