Reconstructing evolutionary trees from DNA and proteinsequences: paralinear distances.
- 15 February 1994
- journal article
- Published by Proceedings of the National Academy of Sciences in Proceedings of the National Academy of Sciences
- Vol. 91 (4) , 1455-1459
- https://doi.org/10.1073/pnas.91.4.1455
Abstract
The reconstruction of phylogenetic trees from DNA and protein sequences is confounded by unequal rate effects. These effects can group rapidly evolving taxa with other rapidly evolving taxa, whether or not they are genealogically related. All algorithms are sensitive to these effects whenever the assumptions on which they are based are not met. The algorithm presented here, called paralinear distances, is valid for a much broader class of substitution processes than previous algorithms and is accordingly less affected by unequal rate effects. It may be used with all nucleic acid, protein, or other sequences, provided that their evolution may be modeled as a succession of Markov processes. The properties of the method have been proven both analytically and by computer simulations. Like all other methods, paralinear distances can fail when sequences are misaligned or when site-to-site sequence variation of rates is extensive. To examine the usefulness of paralinear distances, the "origin of the eukaryotes" has been investigated by the analysis of elongation factor Tu sequences with a variety of sequence alignments. It has been found that the order in which sequences are pairwise aligned strongly determines the topology which is reconstructed by paralinear distances (as it does for all other reconstruction methods tested). When the parts of the alignment that are unaffected by alignment order are analyzed, paralinear distances strongly select the eocyte topology. This provides evidence that the eocyte prokaryotes are the closest prokaryotic relatives of the eukaryotes.Keywords
This publication has 15 references indexed in Scilit:
- Confidence in evolutionary trees from biological sequence dataNature, 1993
- The sequence of the gene encoding elongation factor Tu from Chlamydia trachomatis compared with those of other organismsGene, 1992
- Evidence that eukaryotes and eocyte prokaryotes are immediate relativesScience, 1992
- [27] Dynamic programming algorithms for biological sequence comparisonPublished by Elsevier ,1992
- Functional implications related to the gene structure of the elongation factor EF-Tu fromHalobacterium marismortuiNucleic Acids Research, 1990
- Origin of the eukaryotic nucleus determined by rate-invariant analysis of rRNA sequencesNature, 1988
- Optimal sequence alignmentsProceedings of the National Academy of Sciences, 1983
- [47] Establishing homologies in protein sequencesPublished by Elsevier ,1983
- The nucleotide sequence of the cloned tufA gene of Escherichia coliGene, 1980
- The evolution of the globin family genes: Concordance of stochastic and augmented maximum parsimony genetic distances for α hemoglobin, β hemoglobin and myoglobin phylogeniesJournal of Molecular Biology, 1976