Exploiting mixed-mode parallelism for matrix operations on the HERA architecture through reconfiguration
- 1 January 2006
- journal article
- Published by Institution of Engineering and Technology (IET) in IEE Proceedings - Computers and Digital Techniques
- Vol. 153 (4) , 249-260
- https://doi.org/10.1049/ip-cdt:20045136
Abstract
Recent advances in multi-million-gate platform field-programmable gate arrays (FPGAs) have made it possible to design and implement complex parallel systems on a programmable chip that also incorporate hardware floating-point units (FPUs). These options take advantage of resource reconfiguration. In contrast to the majority of the FPGA community that still employs reconfigurable logic to develop algorithm-specific circuitry, our FPGA-based mixed-mode reconfigurable computing machine can implement simultaneously a variety of parallel execution modes and is also user programmable. Our heterogeneous reconfigurable architecture (HERA) machine can implement the single-instruction, multiple-data (SIMD), multiple-instruction, multiple-data (MIMD) and multiple-SIMD (M-SIMD) execution modes. Each processing element (PE) is centred on a single-precision IEEE 754 FPU with tightly-coupled local memory, and supports dynamic switching between SIMD and MIMD at runtime. Mixed-mode parallelism has the potential to best match the characteristics of all subtasks in applications, thus resulting in sustained high performance. HERA's performance is evaluated by two common computation-intensive testbenches: matrix–matrix multiplication (MMM) and LU factorisation of sparse doubly-bordered-block-diagonal (DBBD) matrices. Experimental results with electrical power network matrices show that the mixed-mode scheduling for LU factorisation can result in speedups of about 19% and 15.5% compared to the SIMD and MIMD implementations, respectively.Keywords
This publication has 15 references indexed in Scilit:
- Hierarchical synthesis of complex DSP functions on FPGAsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2004
- An FPGA based coprocessor for large matrix product implementationPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2004
- The Vector-Thread ArchitectureACM SIGARCH Computer Architecture News, 2004
- Parallel LU factorization of sparse matrices on FPGA‐based configurable computing enginesConcurrency and Computation: Practice and Experience, 2004
- Evolutionary algorithm for the promotion of evolvable hardwareIEE Proceedings - Computers and Digital Techniques, 2004
- Reconfigurable computingACM Computing Surveys, 2002
- Coming challenges in microarchitecture and architectureProceedings of the IEEE, 2001
- Are single-chip multiprocessors in reach?IEEE Design & Test of Computers, 2001
- Reconfigurable Computing for Digital Signal Processing: A SurveyJournal of Signal Processing Systems, 2001
- MorphoSys: an integrated reconfigurable system for data-parallel and computation-intensive applicationsIEEE Transactions on Computers, 2000