The Impact of Multiple Protein Sequence Alignment on Phylogenetic Estimation
- 11 September 2009
- journal article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE/ACM Transactions on Computational Biology and Bioinformatics
- Vol. 8 (4) , 1108-1119
- https://doi.org/10.1109/tcbb.2009.68
Abstract
Multiple sequence alignment is typically the first step in estimating phylogenetic trees, with the assumption being that as alignments improve, so will phylogenetic reconstructions. Over the last decade or so, new multiple sequence alignment methods have been developed to improve comparative analyses of protein structure, but these new methods have not been typically used in phylogenetic analyses. In this paper, we report on a simulation study that we performed to evaluate the consequences of using these new multiple sequence alignment methods in terms of the resultant phylogenetic reconstruction. We find that while alignment accuracy is positively correlated with phylogenetic accuracy, the amount of improvement in phylogenetic estimation that results from an improved alignment can range from quite small to substantial. We observe that phylogenetic accuracy is most highly correlated with alignment accuracy when sequences are most difficult to align, and that variation in alignment accuracy can have little impact on phylogenetic accuracy when alignment error rates are generally low. We discuss these observations and implications for future work.Keywords
This publication has 47 references indexed in Scilit:
- Alignment Uncertainty and Genomic AnalysisScience, 2008
- THE EFFECT OF THE GUIDE TREE ON MULTIPLE SEQUENCE ALIGNMENTS AND SUBSEQUENT PHYLOGENETIC ANALYSESPacific Symposium on Biocomputing, 2007
- Improving progressive alignment for phylogeny reconstruction using parsimonious guide-treesPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2006
- Probalign: multiple sequence alignment using partition function posterior probabilitiesBioinformatics, 2006
- Exploring the Relationship between Sequence Similarity and Accurate Phylogenetic TreesMolecular Biology and Evolution, 2006
- Multiple sequence alignmentCurrent Opinion in Structural Biology, 2006
- Comparison of the Accuracies of Several Phylogenetic Methods Using Protein and DNA SequencesMolecular Biology and Evolution, 2004
- OXBench: A benchmark for evaluation of protein multiple sequence alignment accuracyBMC Bioinformatics, 2003
- Effects of nucleotide sequence alignment on phylogeny estimation: a case study of 18S rDNAs of apicomplexaMolecular Biology and Evolution, 1997
- CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choiceNucleic Acids Research, 1994