Neighborhood Preserving Hashing and Approximate Queries
- 1 January 2001
- journal article
- Published by Society for Industrial & Applied Mathematics (SIAM) in SIAM Journal on Discrete Mathematics
- Vol. 15 (1) , 73-85
- https://doi.org/10.1137/s089548019731809x
Abstract
Let $D \subseteq \Sigma^n$ be a dictionary. We look for efficient data structures and algorithms to solve the following approximate query problem: Given a query $u \in \Sigma^n$ list all words $v \in D$ that are close to u in Hamming distance.The problem reduces to the following combinatorial problem: Hash the vertices of the n-dimensional hypercube into buckets so that (1) the c-neighborhood of each vertex is mapped into at most k buckets and (2) no bucket is too large.Lower and upper bounds are given for the tradeoff between k and the size of the largest bucket. These results are used to derive bounds for the approximate query problem.
Keywords
This publication has 3 references indexed in Scilit:
- Covering radius---Survey and recent resultsIEEE Transactions on Information Theory, 1985
- A survey of perfect codesRocky Mountain Journal of Mathematics, 1975
- Nearly perfect binary codesDiscrete Mathematics, 1972