A Table-Driven, Full-Sensitivity Similarity Search Algorithm
- 1 April 2003
- journal article
- research article
- Published by Mary Ann Liebert Inc in Journal of Computational Biology
- Vol. 10 (2) , 103-117
- https://doi.org/10.1089/106652703321825919
Abstract
Searching a database for a local alignment to a query under a typical scoring scheme, such as PAM120 or BLOSUM62 with affine gap costs, is a computation that has resisted algorithmic improvement due to its basis in dynamic programming and the weak nature of the signals being searched for. In a query preprocessing step, a set of tables can be built that permit one to (a) eliminate a large fraction of the dynamic programming matrix from consideration and (b) to compute several steps of the remainder with a single table lookup. While this result is not an asymptotic improvement over the original Smith–Waterman algorithm, its complexity is characterized in terms of some sparse features of the matrix and it yields the fastest software implementation to date for such searches.Keywords
This publication has 12 references indexed in Scilit:
- An improved algorithm for matching biological sequencesPublished by Elsevier ,2004
- A sublinear algorithm for approximate keyword searchingAlgorithmica, 1994
- Sublinear approximate string matching and biological applicationsAlgorithmica, 1994
- Amino acid substitution matrices from protein blocks.Proceedings of the National Academy of Sciences, 1992
- Basic Local Alignment Search ToolJournal of Molecular Biology, 1990
- A space-efficient parallel sequence comparison algorithm for a message-passing multiprocessorInternational Journal of Parallel Programming, 1989
- P-NAC: A Systolic Array for Comparing Nucleic Acid SequencesComputer, 1987
- AnO(ND) difference algorithm and its variationsAlgorithmica, 1986
- Rapid and Sensitive Protein Similarity SearchesScience, 1985
- [47] Establishing homologies in protein sequencesPublished by Elsevier ,1983