Partial-match retrieval using hashing and descriptors
- 1 December 1983
- journal article
- Published by Association for Computing Machinery (ACM) in ACM Transactions on Database Systems
- Vol. 8 (4) , 552-576
- https://doi.org/10.1145/319996.320006
Abstract
This paper studies a partial-match retrieval scheme based on hash functions and descriptors. The emphasis is placed on showing how the use of a descriptor file can improve the performance of the scheme. Records in the file are given addresses according to hash functions for each field in the record. Furthermore, each page of the file has associated with it a descriptor, which is a fixed-length bit string, determined by the records actually present in the page. Before a page is accessed to see if it contains records in the answer to a query, the descriptor for the page is checked. This check may show that no relevant records are on the page and, hence, that the page does not have to be accessed. The method is shown to have a very substantial performance advantage over pure hashing schemes, when some fields in the records have large key spaces. A mathematical model of the scheme, plus an algorithm for optimizing performance, is given.Keywords
This publication has 11 references indexed in Scilit:
- Dynamic Hashing SchemesThe Computer Journal, 1982
- Partial-match retrieval for dynamic filesBIT Numerical Mathematics, 1982
- Optimal partial-match retrievalBIT Numerical Mathematics, 1980
- Partial-match retrieval using indexed descriptor filesCommunications of the ACM, 1980
- Extendible hashing—a fast access method for dynamic filesACM Transactions on Database Systems, 1979
- Optimal partial-match retrieval when fields are independently specifiedACM Transactions on Database Systems, 1979
- Optimality Properties of Multiple-Key Hashing FunctionsJournal of the ACM, 1979
- Partial-match retrieval via the method of superimposed codesProceedings of the IEEE, 1979
- Partial-Match Retrieval AlgorithmsSIAM Journal on Computing, 1976
- Attribute based file organization in a paged memory environmentCommunications of the ACM, 1974