Estimating Species Phylogenies Using Coalescence Times among Sequences
Top Cited Papers
Open Access
- 16 July 2009
- journal article
- research article
- Published by Oxford University Press (OUP) in Systematic Biology
- Vol. 58 (5) , 468-477
- https://doi.org/10.1093/sysbio/syp031
Abstract
The estimation of species trees (phylogenies) is one of the most important problems in evolutionary biology, and recently, there has been greater appreciation of the need to estimate species trees directly rather than using gene trees as a surrogate. A Bayesian method constructed under the multispecies coalescent model can consistently estimate species trees but involves intensive computation, which can hinder its application to the phylogenetic analysis of large-scale genomic data. Many summary statistics–based approaches, such as shallowest coalescences (SC) and Global LAteSt Split (GLASS), have been developed to infer species phylogenies for multilocus data sets. In this paper, we propose 2 methods, species tree estimation using average ranks of coalescences (STAR) and species tree estimation using average coalescence times (STEAC), based on the summary statistics of coalescence times. It can be shown that the 2 methods are statistically consistent under the multispecies coalescent model. STAR uses the ranks of coalescences and is thus resistant to variable substitution rates along the branches in gene trees. A simulation study suggests that STAR consistently outperforms STEAC, SC, and GLASS when the substitution rates among lineages are highly variable. Two real genomic data sets were analyzed by the 2 methods and produced species trees that are consistent with previous results.Keywords
This publication has 36 references indexed in Scilit:
- Properties of Consensus Methods for Inferring Species Trees from Gene TreesSystematic Biology, 2009
- IS A NEW AND GENERAL THEORY OF MOLECULAR SYSTEMATICS EMERGING?Evolution, 2009
- ESTIMATING SPECIES TREES USING MULTIPLE-ALLELE DNA SEQUENCE DATAEvolution, 2008
- Discordance of Species Trees with Their Most Likely Gene Trees: The Case of Five TaxaSystematic Biology, 2008
- Coalescent Time Distributions in Trees of Arbitrary SizeStatistical Applications in Genetics and Molecular Biology, 2008
- Rooted triple consensus and anomalous gene treesBMC Ecology and Evolution, 2008
- High-resolution species trees without concatenationProceedings of the National Academy of Sciences, 2007
- Discordance of Species Trees with Their Most Likely Gene TreesPLoS Genetics, 2006
- Gene Trees in Species TreesSystematic Biology, 1997
- Evolutionary trees from DNA sequences: A maximum likelihood approachJournal of Molecular Evolution, 1981