Efficient Biased Estimation of Evolutionary Distances When Substitution Rates Vary Across Sites
Open Access
- 1 April 2002
- journal article
- research article
- Published by Oxford University Press (OUP) in Molecular Biology and Evolution
- Vol. 19 (4) , 534-543
- https://doi.org/10.1093/oxfordjournals.molbev.a004109
Abstract
This paper deals with phylogenetic inference when the variability of substitution rates across sites (VRAS) is modeled by a gamma distribution. We show that underestimating VRAS, which results in underestimates for the evolutionary distances between sequences, usually improves the topological accuracy of phylogenetic tree inference by distance-based methods, especially when the molecular clock holds. We propose a method to estimate the gamma shape parameter value which is most suited for tree topology inference, given the sequences at hand. This method is based on the pairwise evolutionary distances between sequences and allows one to reconstruct the phylogeny of a high number of taxa (>1,000). Simulation results show that the topological accuracy is highly improved when using the gamma shape parameter value given by our method, compared with the true (unknown) value which was used to generate the data. Furthermore, when VRAS is high, the topological accuracy of our distance-based method is better than that of a maximum likelihood approach. Finally, a data set of Maoricicada species sequences is analyzed, which confirms the advantage of our method.Keywords
This publication has 21 references indexed in Scilit:
- Can We Have Confidence in a Tree Representation?Published by Springer Nature ,2001
- Exploring Among-Site Rate Variation Models in a Maximum Likelihood Framework Using Empirical Data: Effects of Model Assumptions on Estimates of Topology, Branch Lengths, and Bootstrap SupportSystematic Biology, 2001
- Evaluating Hypotheses on the Origin and Evolution of the New Zealand Alpine Cicadas (Maoricicada) Using Multiple-Comparison Tests of Tree TopologyMolecular Biology and Evolution, 2001
- BIONJ: an improved version of the NJ algorithm based on a simple model of sequence dataMolecular Biology and Evolution, 1997
- Split decomposition: A new and useful approach to phylogenetic analysis of distance dataMolecular Phylogenetics and Evolution, 1992
- Dating of the human-ape splitting by a molecular clock of mitochondrial DNAJournal of Molecular Evolution, 1985
- Evolutionary trees from DNA sequences: A maximum likelihood approachJournal of Molecular Evolution, 1981
- Transfer-RNA: The early adaptorThe Science of Nature, 1981
- A simple method for estimating evolutionary rates of base substitutions through comparative studies of nucleotide sequencesJournal of Molecular Evolution, 1980
- Evolution of Protein MoleculesPublished by Elsevier ,1969