An Integrated Memory Array Processor for Embedded Image Recognition Systems
- 2 April 2007
- journal article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Computers
- Vol. 56 (5) , 622-634
- https://doi.org/10.1109/tc.2007.1010
Abstract
Embedded processors for video image recognition in most cases not only need to address the conventional cost (die size and power) versus real-time performance issue, but must also maintain high flexibility due to the immense diversity of recognition targets, situations, and applications. This paper describes IMAP, a highly parallel SIMD linear processor and memory array architecture that addresses these trade-off requirements. By using parallel and systolic algorithmic techniques, but based on a simple linear array architecture, IMAP successfully exploits not only the straightforward per-image row data level parallelism (DLP), but also the inherent DLP of other memory access patterns frequently found in various image recognition tasks, while allowing programming to be done using an explicit parallel C language (1DC). We describe and evaluate IMAP-CE, one of the latest IMAP processors, integrating 128 100 MHz 8 bit 4-way VLIW PEs, 128 2 KByte RAMs, and one 16 bit RISC control processor onto a single chip. The PE instruction set is enhanced to support 1DC code. The die size of IMAP-CE is 11 times11 mm2 integrating 32.7 M transistors, while the power consumption is, on average, approximately 2 watts. IMAP-CE is evaluated mainly by comparing its performance while running 1DC code with that of a 2.4 GHz Intel P4 running optimized C code. Based on the use of parallelizing techniques, benchmark results show a speed increase of up to 20 times for image filter kernels and of 4 times for a full image recognition applicationKeywords
This publication has 30 references indexed in Scilit:
- Distance transformations in digital imagesPublished by Elsevier ,2006
- Implementation of a 4/sup th/-generation 1.8GHz dual-core sparc V9 microprocessorPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- Fast parallel FFT on a reconfigurable computation platformPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2004
- Languages for massively parallel computersPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- Xetal: a low-power high-performance smart camera processorPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- The Scc Compiler: SWARing at MMX and 3D Now!Published by Springer Nature ,2000
- Image processing using one-dimensional processor arraysProceedings of the IEEE, 1996
- High performance fortranLecture Notes in Computer Science, 1994
- Steps toward architecture-independent image processingComputer, 1992
- A Computational Approach to Edge DetectionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1986