Alignment and Topological Accuracy of the Direct Optimization approach via POY and Traditional Phylogenetics via ClustalW + PAUP*
Open Access
- 1 April 2007
- journal article
- research article
- Published by Oxford University Press (OUP) in Systematic Biology
- Vol. 56 (2) , 182-193
- https://doi.org/10.1080/10635150701281102
Abstract
Direct optimization frameworks for simultaneously estimating alignments and phylogenies have recently been developed. One such method, implemented in the program POY, is becoming more common for analyses of variable length sequences (e.g., analyses using ribosomal genes) and for combined evidence analyses (morphology + multiple genes). Simulation of sequences containing insertion and deletion events was performed in order to directly compare a widely used method of multiple sequence alignment (ClustalW) and subsequent parsimony analysis in PAUP* with direct optimization via POY. Data sets were simulated for pectinate, balanced, and random tree shapes under different conditions (clocklike, non-clocklike, and ultrametric). Alignment accuracy scores for the implied alignments from POY and the multiple sequence alignments from ClustalW were calculated and compared. In almost all cases (99.95%), ClustalW produced more accurate alignments than POY-implied alignments, judged by the proportion of correctly identified homologous sites. Topological accuracy (distance to the true tree) for POY topologies and topologies generated under parsimony in PAUP* from the ClustalW alignments were also compared. In 44.94% of the cases, Clustal alignment tree reconstructions via PAUP* were more accurate than POY, whereas in 16.71% of the cases POY reconstructions were more topologically accurate (38.38% of the time they were equally accurate). Comparisons between POY hypothesized alignments and the true alignments indicated that, on average, as alignment error increased, topological accuracy decreased.Keywords
This publication has 58 references indexed in Scilit:
- Simultaneous Statistical Multiple Alignment and Phylogeny ReconstructionSystematic Biology, 2005
- Generating implied alignments under direct optimization using POYCladistics, 2005
- Sequence length variation, indel costs, and congruence in sensitivity analysisCladistics, 2005
- Comparison of the Accuracies of Several Phylogenetic Methods Using Protein and DNA SequencesMolecular Biology and Evolution, 2004
- Reconstructing large regions of an ancestral mammalian genome in silicoGenome Research, 2004
- Aligning Multiple Genomic Sequences With the Threaded Blockset AlignerGenome Research, 2004
- Philosophy and Phylogenetic Inference: A Comparison of Likelihood and Parsimony Methods in the Context of Karl Popper's Writings on CorroborationSystematic Biology, 2001
- Exploring the Behavior of POY, a Program for Direct Optimization of Molecular DataCladistics, 2001
- The Archaea Monophyly Issue: A Phylogeny of Translational Elongation Factor G(2) Sequences Inferred from an Optimized Selection of Alignment PositionsJournal of Molecular Evolution, 1999
- CONCEPTS AND TESTS OF HOMOLOGY IN THE CLADISTIC PARADIGMCladistics, 1991