Abstract
In recent years, scientific computing is being driven by microprocessor-based architectures. Most architectural designs are characterized by fast processors, fast but small caches, and large but slow memories. As a result, problems of small sizes that fit in cache perform exceedingly well, whereas the performance of larger problems is limited by the speed of memory. In this paper, the authors study the performance characteristics of several iterative kernels for solving sparse linear systems on several popular microprocessors. Given the performance limitations posed by slow memory on large problem sizes, the authors show the effectiveness of using domain decomposition methods of the additive Schwarz type to enhance performance on single microprocessors.

This publication has 15 references indexed in Scilit: