Neighborhood Preserving Hashing and Approximate Queries

1 January 2001

journal article
Published by Society for Industrial & Applied Mathematics (SIAM) in SIAM Journal on Discrete Mathematics

Vol. 15 (1) , 73-85
https://doi.org/10.1137/s089548019731809x

Abstract

Let $D \subseteq \Sigma^n$ be a dictionary. We look for efficient data structures and algorithms to solve the following approximate query problem: Given a query $u \in \Sigma^n$ list all words $v \in D$ that are close to u in Hamming distance.The problem reduces to the following combinatorial problem: Hash the vertices of the n-dimensional hypercube into buckets so that (1) the c-neighborhood of each vertex is mapped into at most k buckets and (2) no bucket is too large.Lower and upper bounds are given for the tradeoff between k and the size of the largest bucket. These results are used to derive bounds for the approximate query problem.

Keywords

This publication has 3 references indexed in Scilit:

Covering radius---Survey and recent results
IEEE Transactions on Information Theory, 1985
A survey of perfect codes
Rocky Mountain Journal of Mathematics, 1975
Nearly perfect binary codes
Discrete Mathematics, 1972