Memory-centric video processing
- 15 April 2008
- journal article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Circuits and Systems for Video Technology
- Vol. 18 (4) , 439-452
- https://doi.org/10.1109/tcsvt.2008.918775
Abstract
This work presents a domain-specific memory subsystem based on a two-level memory hierarchy. It targets the application domain of video post-processing applications including video enhancement and format conversion. These applications are based on motion compensation and/or broad class of content adaptive filtering to provide the highest quality of pictures. Our approach meets the required performance and has sufficient flexibility for the application domain. It especially aims at the implementation-wise most challenging applications: compute-intensive and bandwidth-demanding applications that provide the highest quality at high picture resolutions. The lowest level of the memory hierarchy, closest to the processing element, the L0 scratchpad, is organized specifically to enable fast retrieval of an arbitrarily positioned 2-D block of pixels to the processing element. To guarantee the performance, most of its addressing logic is hardwired, leaving a user a set of API for initialization and storing/loading the data to/from the L0 scratchpad. The next level of the memory hierarchy, the L1 scratchpad, minimizes the off-chip memory bandwidth requirements. The L1 scratchpad is organized specifically to enable efficient aligned block-based accesses. With lower data rates compared to the L0 scratchpad and aligned block access, software-based addressing is used to enable full flexibility. The two-level memory hierarchy exploits prefetching to further improve the performance.Keywords
This publication has 17 references indexed in Scilit:
- An Efficient Picture-Rate Up-ConverterJournal of Signal Processing Systems, 2005
- On Design of Parallel Memory Access Schemes for Video CodingJournal of Signal Processing Systems, 2005
- Application specific instruction-set processor template for motion estimation in video applicationsIEEE Transactions on Circuits and Systems for Video Technology, 2005
- Tackling occlusion in scan rate conversion systemsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- A programmable co-porcessor for MPEG-4 videoPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- An efficient buffer memory system for subarray accessIEEE Transactions on Parallel and Distributed Systems, 2001
- A 1.3-GOPS parallel DSP for high-performance image-processing applicationsIEEE Journal of Solid-State Circuits, 2000
- System on silicon-IC for motion compensated scan rate conversion picture-in-picture processing, split screen applications and display processingIEEE Transactions on Consumer Electronics, 1999
- Access and Alignment of Data in an Array ProcessorIEEE Transactions on Computers, 1975
- The Organization and Use of Parallel MemoriesIEEE Transactions on Computers, 1971