OPTIMIZATION AND PROFILING OF THE CACHE PERFORMANCE OF PARALLEL LATTICE BOLTZMANN CODES
- 1 December 2003
- journal article
- research article
- Published by World Scientific Pub Co Pte Ltd in Parallel Processing Letters
- Vol. 13 (04) , 549-560
- https://doi.org/10.1142/s0129626403001501
Abstract
When designing and implementing highly efficient scientific applications for parallel computers such as clusters of workstations, it is inevitable to consider and to optimize the single-CPU performance of the codes. For this purpose, it is particularly important that the codes respect the hierarchical memory designs that computer architects employ in order to hide the effects of the growing gap between CPU performance and main memory speed. In this article, we present techniques to enhance the single-CPU efficiency of lattice Boltzmann methods which are commonly used in computational fluid dynamics. We show various performance results for both 2D and 3D codes in order to emphasize the effectiveness of our optimization techniques.Keywords
This publication has 8 references indexed in Scilit:
- An Overview of Cache Optimization Techniques and Cache-Aware Numerical AlgorithmsPublished by Springer Nature ,2003
- Cache Performance Optimizations for Parallel Lattice Boltzmann CodesPublished by Springer Nature ,2003
- FFTW: an adaptive software architecture for the FFTPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Performance Optimization of Numerically Intensive CodesPublished by Society for Industrial & Applied Mathematics (SIAM) ,2001
- A Portable Programming Interface for Performance Evaluation on Modern ProcessorsThe International Journal of High Performance Computing Applications, 2000
- Lattice Gas Cellular Automata and Lattice Boltzmann ModelsLecture Notes in Mathematics, 2000
- LATTICE BOLTZMANN METHOD FOR FLUID FLOWSAnnual Review of Fluid Mechanics, 1998
- Numerical Simulation in Fluid DynamicsPublished by Society for Industrial & Applied Mathematics (SIAM) ,1998