The physical mapping problem for parallel architectures

1 June 1988

journal article
Published by Association for Computing Machinery (ACM) in Journal of the ACM

Vol. 35 (3) , 603-634
https://doi.org/10.1145/44483.44489

Abstract

The problem of realizing an idealized parallel architecture on a (possibly fault-laden) physical architecture is studied. Our formulation performs the mapping in the light of the algorithm that one wants to implement on the idealized architecture. A version of the mapping algorithm suggested by the DIOGENES methodology for designing fault-tolerant VLSI processor arrays is settled definitely. Two quality metrics for mappings are considered, the first embodying an idealized notion of average delay , which relates to power consumption, and the second being the length of the longest run of wire . For the average-delay measure, four algorithms that optimally assign the m vertices of the embedded graph to the n fault-free processors that have been fabricated are presented. The most general algorithm makes no assumptions about the structure of the array or the physical format of the processors; it runs in time O ( m · ( n - m ) ² ). The other algorithms assume that the processors are laid out in such a way that interprocessor distances obey the triangle equality; they run in times ranging from time O (max{ m , n - m } · log min { m, n - m }) for certain array structures, including linear arrays, to time O (max{ m, n - m }) for a narrow class of array structures, including pyramid arrays. For the max-wire-run cost measure, it is shown that the problem of finding cost-optimal vertex-to-processor assignments is NP-complete. However, an algorithm is presented that yields, in time O ( m · ( n - m ) ² ), vertex-to-processor assignments that are within a factor of 3 of optimal (they are optimal when the input graph-embedding is outplanar). This algorithm can easily be converted to one that yields, in time O ( m · ( n - m ) ³ ), vertex-to-processor assignments that are within a factor of 2 of optimal. Finally, an algorithm that yields optimal assignments when the interprocessor distances obey the triangle equality is presented; this algorithm operates in time O ( m · ( n - m ) · log( m · ( n - m )) · log M ), where M is the largest interprocessor distance.

Keywords

This publication has 5 references indexed in Scilit:

A Mapping Strategy for Parallel Processing
IEEE Transactions on Computers, 1987
Wafer-Scale Integration of Systolic Arrays
IEEE Transactions on Computers, 1985
Configuration of VLSI Arrays in the Presence of Defects
Journal of the ACM, 1984
Worst-case Analysis of Set Union Algorithms
Journal of the ACM, 1984
The Diogenes Approach to Testable Fault-Tolerant Arrays of Processors
IEEE Transactions on Computers, 1983