Dynamic Programming Alignment Accuracy
- 1 January 1998
- journal article
- Published by Mary Ann Liebert Inc in Journal of Computational Biology
- Vol. 5 (3) , 493-504
- https://doi.org/10.1089/cmb.1998.5.493
Abstract
Algorithms for generating alignments of biological sequences have inherent statistical limitations when it comes to the accuracy of the alignments they produce. Using simulations, we measure the accuracy of the standard global dynamic programming method and show that it can be reasonably well modelled by an "edge wander" approximation to the distribution of the optimal scoring path around the correct path in the vicinity of a gap. We also give a table from which accuracy values can be predicted for commonly used scoring schemes and sequence divergences (the PAM and BLOSUM series). Finally we describe how to calculate the expected accuracy of a given alignment, and show how this can be used to construct an optimal accuracy alignment algorithm which generates significantly more accurate alignments than standard dynamic programming methods in simulated experiments.Keywords
This publication has 10 references indexed in Scilit:
- Identification of common molecular subsequencesPublished by Elsevier ,2004
- Significant Improvement in Accuracy of Multiple Protein Sequence Alignments by Iterative Refinement as Assessed by Reference to Structural AlignmentsJournal of Molecular Biology, 1996
- Similarity Detection and LocalizationPhysical Review Letters, 1996
- Quantifying the local reliability of a sequence alignmentProtein Engineering, Design and Selection, 1996
- A reliable sequence alignment method based on probabilities of residue correspondencesProtein Engineering, Design and Selection, 1995
- Sequence alignment and penalty choiceJournal of Molecular Biology, 1994
- Inching toward reality: An improved likelihood model of sequence evolutionJournal of Molecular Evolution, 1992
- Basic local alignment search toolJournal of Molecular Biology, 1990
- Improved tools for biological sequence comparison.Proceedings of the National Academy of Sciences, 1988
- A general method applicable to the search for similarities in the amino acid sequence of two proteinsJournal of Molecular Biology, 1970