Rsim: simulating shared-memory multiprocessors with ILP processors

7 August 2002

journal article
Published by Institute of Electrical and Electronics Engineers (IEEE) in Computer

Vol. 35 (2) , 40-49
https://doi.org/10.1109/2.982915

Abstract

The early 1990s saw several announcements of commercial shared-memory systems using processors that aggressively exploited instruction-level parallelism (ILP), including the MIPS R10000, Hewlett-Packard PA8000, and Intel Pentium Pro. These processors could potentially reduce memory read stalls by overlapping read latency with other operations, possibly changing the nature of performance bottlenecks in the system. The authors' experience with Rsim demonstrates that modeling ILP features is important even in shared-memory multiprocessor systems. In particular, current simple processor-based approximations cannot model significant performance effects for applications exhibiting parallel read misses. Further, recent shared-memory designs such as aggressive implementations of sequential consistency use the aggressive ILP-enhancing features of modern processors that simple processor-based simulators do not model. As microprocessor systems become more complex, the availability of shared infrastructure source code is likely to become increasingly crucial. The authors plan to release a new Rsim version shortly that will include instruction caches, TLBs, multimedia extensions, simultaneous multithreading, Rabbit fast simulation mode, and ports to Linux platforms.

Keywords

This publication has 9 references indexed in Scilit:

Code transformations to improve memory parallelism
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2003
Cross-profiling as an efficient technique in simulating parallel computer systems
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2003
The impact of instruction-level parallelism on multiprocessor performance and simulation methodology
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
Comparing and combining read miss clustering and software prefetching
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
Measuring Experimental Error in Microprocessor Simulation
Published by Association for Computing Machinery (ACM) ,2001
FLASH vs. (Simulated) FLASH
Published by Association for Computing Machinery (ACM) ,2000
Improving the accuracy vs. speed tradeoff for simulating shared-memory multiprocessors with ILP processors
Published by Institute of Electrical and Electronics Engineers (IEEE) ,1999
Fast out-of-order processor simulation using memoization
Published by Association for Computing Machinery (ACM) ,1998
Using the SimOS machine simulator to study complex computer systems
ACM Transactions on Modeling and Computer Simulation, 1997