Fast barrier synchronization hardware
- 4 December 2002
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
Abstract
Many recent studies have considered the importance of barrier synchronization overhead on parallel loop performance, especially for large-scale parallel machines. This paper describes a hardware scheme for supporting fast barrier synchronization. It allows barrier synchronization to be performed within a single instruction cycle for moderately sized systems, and is scalable with logarithmic increase in synchronization time. It supports a large number of concurrent barriers, and can also be used to support a number of different barrier synchronization schemes. Simulation results show that under reasonable assumptions, this hardware can decrease parallel loop execution time significantly, especially for statically scheduled loops.Keywords
This publication has 14 references indexed in Scilit:
- Information integration and synchronization in distributed sensor networksIEEE Transactions on Systems, Man, and Cybernetics, 1991
- The fuzzy barrier: a mechanism for high speed synchronization of processorsPublished by Association for Computing Machinery (ACM) ,1989
- A fetch-and-op implementation for parallel computersACM SIGARCH Computer Architecture News, 1988
- Two algorithms for barrier synchronizationInternational Journal of Parallel Programming, 1988
- Comparing Barrier AlgorithmsPublished by Defense Technical Information Center (DTIC) ,1987
- Distributing Hot-Spot Addressing in Large-Scale MultiprocessorsIEEE Transactions on Computers, 1987
- Effects of synchronization barriers on multiprocessor performanceParallel Computing, 1986
- A decentralized control, highly concurrent multiprocesssorACM SIGARCH Computer Architecture News, 1985
- The NYU Ultracomputer—Designing an MIMD Shared Memory Parallel ComputerIEEE Transactions on Computers, 1983
- Performance of Synchronized Iterative Processes in Multiprocessor SystemsIEEE Transactions on Software Engineering, 1982