A flexible parallel architecture adapted to block-matching motion-estimation algorithms
- 1 January 1996
- journal article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Circuits and Systems for Video Technology
- Vol. 6 (1) , 74-86
- https://doi.org/10.1109/76.486422
Abstract
This paper describes a novel architecture that offers the flexibility of implementing widely varying motion-estimation algorithms. To achieve real-time performance, we employ multiple processing elements (PE's) which communicate with multiple memory banks via a multistage interconnection network. Three different block-matching algorithms-full search, three-step search, and conjugate-direction search-have been mapped onto this architecture to illustrate its programmability. We schedule the desired operations and design the required data-flow in such a way that processor utilization is high and memory bandwidth is at a feasible level. The details regarding the flow of the pixel data and the scheduling and allocation of the desired ALU operations (which pixels are processed on which processors in which clock cycles) are described in the paper. We analyze the performance of the proposed architecture for several different interconnection networks and data-memory organizationsKeywords
This publication has 31 references indexed in Scilit:
- VLSI architectures for hierarchical block matching algorithmsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- VLSI architectures for the hierarchical block-matching algorithm for HDTV applicationsPublished by SPIE-Intl Soc Optical Eng ,1990
- Parameterizable VLSI architectures for the full-search block-matching algorithmIEEE Transactions on Circuits and Systems, 1989
- A microprogrammable real-time video signal processor (VSP) for motion compensationIEEE Journal of Solid-State Circuits, 1988
- A New Interconnection Network for SIMD Computers: The Sigma NetworkIEEE Transactions on Computers, 1987
- Fault-Diagnosis for a Class of Multistage Interconnection NetworksIEEE Transactions on Computers, 1981
- A fast parallel algorithm for routing in permutation networksIEEE Transactions on Computers, 1981
- On a Class of Multistage Interconnection NetworksIEEE Transactions on Computers, 1980
- Access and Alignment of Data in an Array ProcessorIEEE Transactions on Computers, 1975
- Data Manipulating Functions in Parallel Processors and Their ImplementationsIEEE Transactions on Computers, 1974