Optimal multi-step k-nearest neighbor search
- 1 June 1998
- journal article
- Published by Association for Computing Machinery (ACM) in ACM SIGMOD Record
- Vol. 27 (2) , 154-165
- https://doi.org/10.1145/276305.276319
Abstract
For an increasing number of modern database applications, efficient support of similarity search becomes an important task. Along with the complexity of the objects such as images, molecules and mechanical parts, also the complexity of the similarity models increases more and more. Whereas algorithms that are directly based on indexes work well for simple medium-dimensional similarity distance functions, they do not meet the efficiency requirements of complex high-dimensional and adaptable distance functions. The use of a multi-step query processing strategy is recommended in these cases, and our investigations substantiate that the number of candidates which are produced in the filter step and exactly evaluated in the refinement step is a fundamental efficiency parameter. After revealing the strong performance shortcomings of the state-of-the-art algorithm for k -nearest neighbor search [Korn et al. 1996], we present a novel multi-step algorithm which is guaranteed to produce the minimum number of candidates. Experimental evaluations demonstrate the significant performance gain over the previous solution, and we observed average improvement factors of up to 120 for the number of candidates and up to 48 for the total runtime.Keywords
This publication has 18 references indexed in Scilit:
- Multidimensional access methodsACM Computing Surveys, 1998
- Efficient color histogram indexing for quadratic form distance functionsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1995
- Accounting for boundary effects in nearest neighbor searchingPublished by Association for Computing Machinery (ACM) ,1995
- Efficient and effective Querying by Image ContentJournal of Intelligent Information Systems, 1994
- Similar shape retrieval using a structural feature indexInformation Systems, 1993
- Searching for geometric molecular shape complementarity using bidimensional surface profilesJournal of Molecular Graphics, 1992
- Fast K-dimensional tree algorithms for nearest neighbor search with application to vector quantization encodingIEEE Transactions on Signal Processing, 1992
- Refinements to nearest-neighbor searching ink-dimensional treesAlgorithmica, 1991
- PROBE spatial data modeling and query processing in an image database applicationIEEE Transactions on Software Engineering, 1988
- An Algorithm for Finding Best Matches in Logarithmic Expected TimeACM Transactions on Mathematical Software, 1977