Fast approximate string matching in a dictionary

27 November 2002

conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE)

p. 14-22
https://doi.org/10.1109/spire.1998.712978

Abstract

A successful technique to search large textual databases allowing errors relies on an online search in the vocabulary of the text. To reduce the time of that online search, we index the vocabulary as a metric space. We show that with reasonable space overhead we can improve by a factor of two over the fastest online algorithms, when the tolerated error level is low (which is reasonable in text searching).

Keywords

This publication has 10 references indexed in Scilit:

Block addressing indices for approximate text retrieval
Published by Association for Computing Machinery (ACM) ,1997
FastMap
Published by Association for Computing Machinery (ACM) ,1995
Fast text searching
Communications of the ACM, 1992
Satisfying general proximity / similarity queries with metric trees
Information Processing Letters, 1991
Fast string matching with k differences
Journal of Computer and System Sciences, 1988
An algorithm for finding nearest neighbours in (approximately) constant average time
Pattern Recognition Letters, 1986
Finding approximate patterns in strings
Journal of Algorithms, 1985
The theory and computation of evolutionary distances: Pattern recognition
Journal of Algorithms, 1980
The choice of reference points in best-match file searching
Communications of the ACM, 1977
Some approaches to best-match file searching
Communications of the ACM, 1973