A Table-Driven, Full-Sensitivity Similarity Search Algorithm

1 April 2003

journal article
research article
Published by Mary Ann Liebert Inc in Journal of Computational Biology

Vol. 10 (2) , 103-117
https://doi.org/10.1089/106652703321825919

Abstract

Searching a database for a local alignment to a query under a typical scoring scheme, such as PAM120 or BLOSUM62 with affine gap costs, is a computation that has resisted algorithmic improvement due to its basis in dynamic programming and the weak nature of the signals being searched for. In a query preprocessing step, a set of tables can be built that permit one to (a) eliminate a large fraction of the dynamic programming matrix from consideration and (b) to compute several steps of the remainder with a single table lookup. While this result is not an asymptotic improvement over the original Smith–Waterman algorithm, its complexity is characterized in terms of some sparse features of the matrix and it yields the fastest software implementation to date for such searches.

Keywords

This publication has 12 references indexed in Scilit:

An improved algorithm for matching biological sequences
Published by Elsevier ,2004
A sublinear algorithm for approximate keyword searching
Algorithmica, 1994
Sublinear approximate string matching and biological applications
Algorithmica, 1994
Amino acid substitution matrices from protein blocks.
Proceedings of the National Academy of Sciences, 1992
Basic Local Alignment Search Tool
Journal of Molecular Biology, 1990
A space-efficient parallel sequence comparison algorithm for a message-passing multiprocessor
International Journal of Parallel Programming, 1989
P-NAC: A Systolic Array for Comparing Nucleic Acid Sequences
Computer, 1987
AnO(ND) difference algorithm and its variations
Algorithmica, 1986
Rapid and Sensitive Protein Similarity Searches
Science, 1985
[47] Establishing homologies in protein sequences
Published by Elsevier ,1983