Fast data-locality profiling of native execution
- 6 June 2005
- proceedings article
- Published by Association for Computing Machinery (ACM)
- Vol. 33 (1) , 169-180
- https://doi.org/10.1145/1064212.1064232
Abstract
Performance tools based on hardware counters can efficiently profile the cache behavior of an application and help software developers improve its cache utilization. Simulator-based tools can potentially provide more insights and flexibility and model many different cache configurations, but have the drawback of large run-time overhead.We present StatCache, a performance tool based on a statistical cache model. It has a small run-time overhead while providing much of the flexibility of simulator-based tools. A monitor process running in the background collects sparse memory access statistics about the analyzed application running natively on a host computer. Generic locality information is derived and presented in a code-centric and/or data-centric view.We evaluate the accuracy and performance of the tool using ten SPEC CPU2000 benchmarks. We also exemplify how the flexibility of the tool can be used to better understand the characteristics of cache-related performance problems.Keywords
This publication has 22 references indexed in Scilit:
- Estimating cache misses and locality using stack distancesPublished by Association for Computing Machinery (ACM) ,2003
- Cache miss equationsACM Transactions on Programming Languages and Systems, 1999
- Combining trace sampling with single pass methods for efficient cache simulationIEEE Transactions on Computers, 1998
- Continuous profilingACM Transactions on Computer Systems, 1997
- Using the SimOS machine simulator to study complex computer systemsACM Transactions on Modeling and Computer Simulation, 1997
- Tuning memory performance of sequential and parallel programsComputer, 1995
- Trap-driven simulation with Tapeworm IIPublished by Association for Computing Machinery (ACM) ,1994
- A comparison of trace-sampling techniques for multi-megabyte cachesIEEE Transactions on Computers, 1994
- A model for estimating trace-sample miss ratiosACM SIGMETRICS Performance Evaluation Review, 1991
- Accurate low-cost methods for performance evaluation of cache memory systemsIEEE Transactions on Computers, 1988