Index scans using a finite LRU buffer: a validated I/O model
- 1 September 1989
- journal article
- Published by Association for Computing Machinery (ACM) in ACM Transactions on Database Systems
- Vol. 14 (3) , 401-424
- https://doi.org/10.1145/68012.68016
Abstract
Indexes are commonly employed to retrieve a portion of a file or to retrieve its records in a particular order. An accurate performance model of indexes is essential to the design, analysis, and tuning of file management and database systems, and particularly to database query optimization. Many previous studies have addressed the problem of estimating the number of disk page fetches when randomly accessing k records out of N given records stored on T disk pages. This paper generalizes these results, relaxing two assumptions that usually do not hold in practice: unlimited buffer and unique records for each key value. Experiments show that the performance of an index scan is very sensitive to buffer size limitations and multiple records per key value. A model for these more practical situations is presented and a formula derived for estimating the performance of an index scan. We also give a closed-form approximation that is easy to compute. The theoretical results are validated using the R * distributed relational database system. Although we use database terminology throughout the paper, the model is more generally applicable whenever random accesses are made using keys.Keywords
This publication has 15 references indexed in Scilit:
- Estimating block selectivitiesInformation Systems, 1984
- Principles of database buffer managementACM Transactions on Database Systems, 1984
- Implications of certain assumptions in database performance evauationACM Transactions on Database Systems, 1984
- On estimating block accesses in database organizationsCommunications of the ACM, 1983
- Estimating block accesses in database organizationsCommunications of the ACM, 1983
- Duplicate record elimination in large data filesACM Transactions on Database Systems, 1983
- Estimating block accesses and number of records in file managementCommunications of the ACM, 1982
- Operating system support for database managementCommunications of the ACM, 1981
- Approximating block accesses in database organizationsCommunications of the ACM, 1977
- Analysis and performance of inverted data base structuresCommunications of the ACM, 1975