More Genes or More Taxa? The Relative Contribution of Gene Number and Taxon Number to Phylogenetic Accuracy
- 2 March 2005
- journal article
- research article
- Published by Oxford University Press (OUP) in Molecular Biology and Evolution
- Vol. 22 (5) , 1337-1344
- https://doi.org/10.1093/molbev/msi121
Abstract
The relative contribution of taxon number and gene number to accuracy in phylogenetic inference is a major issue in phylogenetics and of central importance to the choice of experimental strategies for the successful reconstruction of a broad sketch of the tree of life. Maximization of the number of taxa sampled is the strategy favored by most phylogeneticists, although its necessity remains the subject of debate. Vast increases in gene number are now possible due to advances in genomics, but large numbers of genes will be available for only modest numbers of taxa, raising the question of whether such genome-scale phylogenies will be robust to the addition of taxa. To examine the relative benefit of increasing taxon number or gene number to phylogenetic accuracy, we have developed an assay that utilizes the symmetric difference tree distance as a measure of phylogenetic accuracy. We have applied this assay to a genome-scale data matrix containing 106 genes from 14 yeast species. Our results show that increasing taxon number correlates with a slight decrease in phylogenetic accuracy. In contrast, increasing gene number has a significant positive effect on phylogenetic accuracy. Analyses of an additional taxon-rich data matrix from the same yeast clade show that taxon number does not have a significant effect on phylogenetic accuracy. The positive effect of gene number and the lack of effect of taxon number on phylogenetic accuracy are also corroborated by analyses of two data matrices from mammals and angiosperm plants, respectively. We conclude that, for typical data sets, the number of genes utilized may be a more important determinant of phylogenetic accuracy than taxon number.Keywords
This publication has 53 references indexed in Scilit:
- Genome evolution in yeastsNature, 2004
- The Ashbya gossypii Genome as a Tool for Mapping the Ancient Saccharomyces cerevisiae GenomeScience, 2004
- Finding Functional Features in Saccharomyces Genomes by Phylogenetic FootprintingScience, 2003
- Faculty of 1000 evaluation for The evolutionary position of nematodes.BMC Ecology and Evolution, 2002
- The analysis of 100 genes supports the grouping of three highly divergent amoebae: Dictyostelium , Entamoeba , and MastigamoebaProceedings of the National Academy of Sciences, 2002
- A Kingdom-Level Phylogeny of Eukaryotes Based on Combined Protein DataScience, 2000
- A few logs suffice to build (almost) all trees: Part IITheoretical Computer Science, 1999
- A few logs suffice to build (almost) all trees (I)Random Structures & Algorithms, 1999
- Do long branches attract flies?Nature, 1995
- Cases in which Parsimony or Compatibility Methods Will be Positively MisleadingSystematic Zoology, 1978