Memory-centric video processing

Abstract

This work presents a domain-specific memory subsystem based on a two-level memory hierarchy. It targets the application domain of video post-processing applications including video enhancement and format conversion. These applications are based on motion compensation and/or broad class of content adaptive filtering to provide the highest quality of pictures. Our approach meets the required performance and has sufficient flexibility for the application domain. It especially aims at the implementation-wise most challenging applications: compute-intensive and bandwidth-demanding applications that provide the highest quality at high picture resolutions. The lowest level of the memory hierarchy, closest to the processing element, the L0 scratchpad, is organized specifically to enable fast retrieval of an arbitrarily positioned 2-D block of pixels to the processing element. To guarantee the performance, most of its addressing logic is hardwired, leaving a user a set of API for initialization and storing/loading the data to/from the L0 scratchpad. The next level of the memory hierarchy, the L1 scratchpad, minimizes the off-chip memory bandwidth requirements. The L1 scratchpad is organized specifically to enable efficient aligned block-based accesses. With lower data rates compared to the L0 scratchpad and aligned block access, software-based addressing is used to enable full flexibility. The two-level memory hierarchy exploits prefetching to further improve the performance.

Keywords

This publication has 17 references indexed in Scilit:

An Efficient Picture-Rate Up-Converter
Journal of Signal Processing Systems, 2005
On Design of Parallel Memory Access Schemes for Video Coding
Journal of Signal Processing Systems, 2005
Application specific instruction-set processor template for motion estimation in video applications
IEEE Transactions on Circuits and Systems for Video Technology, 2005
Tackling occlusion in scan rate conversion systems
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2003
A programmable co-porcessor for MPEG-4 video
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
An efficient buffer memory system for subarray access
IEEE Transactions on Parallel and Distributed Systems, 2001
A 1.3-GOPS parallel DSP for high-performance image-processing applications
IEEE Journal of Solid-State Circuits, 2000
System on silicon-IC for motion compensated scan rate conversion picture-in-picture processing, split screen applications and display processing
IEEE Transactions on Consumer Electronics, 1999
Access and Alignment of Data in an Array Processor
IEEE Transactions on Computers, 1975
The Organization and Use of Parallel Memories
IEEE Transactions on Computers, 1971