Minimum cost selection of secondary indexes for formatted files
- 1 March 1977
- journal article
- Published by Association for Computing Machinery (ACM) in ACM Transactions on Database Systems
- Vol. 2 (1) , 68-90
- https://doi.org/10.1145/320521.320537
Abstract
Secondary indexes are often used in database management systems for secondary key retrieval. Although their use can improve retrieval time significantly, the cost of index maintenance and storage increases the overhead of the file processing application. The optimal set of indexed secondary keys for a particular application depends on a number of application dependent factors. In this paper a cost function is developed for the evaluation of candidate indexing choices and applied to the optimization of index selection. Factors accounted for include file size, the relative rates of retrieval and maintenance and the distribution of retrieval and maintenance over the candidate keys, index structure, and system charging rates. Among the results demonstrated are the increased effectiveness of secondary indexes for large files, the effect of the relative rates of retrieval and maintenance, the greater cost of allowing for arbitrarily formulated queries, and the impact on cost of the use of different index structures.Keywords
This publication has 6 references indexed in Scilit:
- Analysis and performance of inverted data base structuresCommunications of the ACM, 1975
- Retrieval efficiency using combined indicesPublished by Association for Computing Machinery (ACM) ,1972
- Introduction to “feature analysis of generalized data base management systems”Communications of the ACM, 1971
- An optimization problem on the selection of secondary keysPublished by Association for Computing Machinery (ACM) ,1971
- Multi-attribute retrieval with combined indexesCommunications of the ACM, 1970
- The Influence of Data Base Characteristics and Usage on Direct Access File OrganizationJournal of the ACM, 1968