Improving the Practical Space and Time Efficiency of the Shortest-Paths Approach to Sum-of-Pairs Multiple Sequence Alignment
- 1 January 1995
- journal article
- research article
- Published by Mary Ann Liebert Inc in Journal of Computational Biology
- Vol. 2 (3) , 459-472
- https://doi.org/10.1089/cmb.1995.2.459
Abstract
The MSA program, written and distributed in 1989, is one of the few existing programs that attempts to find optimal alignments of multiple protein or DNA sequences. The MSA program implements a branch-and-bound technique together with a variant of Dijkstra's shortest paths algorithm to prune the basic dynamic programming graph. We have made substantial improvements in the time and space usage of MSA. The improvements make feasible a variety of problem instances that were not feasible previously. On some runs we achieve an order of magnitude reduction in space usage and a significant multiplicative factor speedup in running time. To explain how these improvements work, we give a much more detailed description of MSA than has been previously available. In practice, MSA rarely produces a provably optimal alignment and we explain why.Keywords
This publication has 12 references indexed in Scilit:
- The parameterized complexity of sequence alignment and consensusPublished by Springer Nature ,1994
- A survey of multiple sequence comparison methodsBulletin of Mathematical Biology, 1992
- Weights for data related by a treeJournal of Molecular Biology, 1989
- A tool for multiple sequence alignment.Proceedings of the National Academy of Sciences, 1989
- Gap costs for multiple sequence alignmentJournal of Theoretical Biology, 1989
- The Multiple Sequence Alignment Problem in BiologySIAM Journal on Applied Mathematics, 1988
- Progressive sequence alignment as a prerequisitetto correct phylogenetic treesJournal of Molecular Evolution, 1987
- Evaluation and improvements in the automatic alignment of protein sequencesProtein Engineering, Design and Selection, 1987
- The Complexity of Some Problems on Subsequences and SupersequencesJournal of the ACM, 1978
- A note on two problems in connexion with graphsNumerische Mathematik, 1959