The Queue-Read Queue-Write PRAM Model: Accounting for Contention in Parallel Algorithms
- 1 January 1998
- journal article
- Published by Society for Industrial & Applied Mathematics (SIAM) in SIAM Journal on Computing
- Vol. 28 (2) , 733-769
- https://doi.org/10.1137/s009753979427491
Abstract
This paper introduces the queue-read queue-write ({\sc qrqw}) parallel random access machine ({\sc pram}) model, which permits concurrent reading and writing to shared-memory locations, but at a cost proportional to the number of readers/writers to any one memory location in a given step. Prior to this work there were no formal complexity models that accounted for the contention to memory locations, despite its large impact on the performance of parallel programs. The {\sc qrqw pram} model reflects the contention properties of most commercially available parallel machines more accurately than either the well-studied {\sc crcw pram} or {\sc erew pram} models: the {\sc crcw} model does not adequately penalize algorithms with high contention to shared-memory locations, while the {\sc erew} model is too strict in its insistence on zero contention at each step.The {\sc qrqw pram} is strictly more powerful than the {\sc erew pram}. This paper shows a separation of $\sqrt{\log n}$ between the two models, and presents faster and more efficient {\sc qrqw} algorithms for several basic problems, such as linear compaction, leader election, and processor allocation. Furthermore, we present a work-preserving emulation of the {\sc qrqw pram} with only logarithmic slowdown on Valiant's {\sc bsp} model, and hence on hypercube-type noncombining networks, even when latency, synchronization, and memory granularity overheads are taken into account. This matches the best-known emulation result for the {\sc erew pram}, and considerably improves upon the best-known efficient emulation for the {\sc crcw pram} on such networks. Finally, the paper presents several lower bound results for this model, including lower bounds on the time required for broadcasting and for leader election.
Keywords
This publication has 19 references indexed in Scilit:
- ERCW PRAMs and optical communicationTheoretical Computer Science, 1998
- Contention in shared memory algorithmsJournal of the ACM, 1997
- Efficient Low-Contention Parallel AlgorithmsJournal of Computer and System Sciences, 1996
- Retrieval of scattered information by EREW, CREW, and CRCW PRAMscomputational complexity, 1995
- Exact lower time bounds for computing Boolean functions on CREW PRAMsJournal of Computer and System Sciences, 1994
- A bridging model for parallel computationCommunications of the ACM, 1990
- Parallel Prefix ComputationJournal of the ACM, 1980
- The Parallel Evaluation of General Arithmetic ExpressionsJournal of the ACM, 1974
- Probability Inequalities for Sums of Bounded Random VariablesJournal of the American Statistical Association, 1963
- Probability Inequalities for Sums of Bounded Random VariablesJournal of the American Statistical Association, 1963