How Many Genes Should a Systematist Sample? Conflicting Insights from a Phylogenomic Matrix Characterized by Replicated Incongruence
Open Access
- 1 April 2007
- journal article
- Published by Oxford University Press (OUP) in Systematic Biology
- Vol. 56 (2) , 355-363
- https://doi.org/10.1080/10635150701294733
Abstract
The average size of molecular systematic data sets has grown steadily over the past 20 years. Combined phylogenetic matrices that include multiple genetic loci currently are the norm, and in many cases, rapid compilation of extremely large DNA data sets is feasible. Thus, a frequently asked question is “How many genes should a systematist sequence in order to generate a robust phylogenetic hypothesis?” This query generally has been addressed by computer simulation, where the amount of virtual DNA sequence data that can be generated is unlimited (e.g., Huelsenbeck and Hillis, 1993). Genomic data, however, provide systematists with a multitude of empirical molecular data for phylogenetic analysis, and several authors have taken advantage of this resource to examine the effects of increasing the number of genes to quantities that seemed impossible in the recent past (e.g., Cummings et al., 1995; Bapteste et al., 2002; Goremykin, 2004).Keywords
This publication has 46 references indexed in Scilit:
- Supertree Bootstrapping Methods for Assessing Phylogenetic Variation among Genes in Genome-Scale Data SetsSystematic Biology, 2006
- Magic bullets and golden rules: Data sampling in molecular phylogeneticsZoology, 2005
- Choosing the Best Genes for the Job: The Case for Stationary Genes in Genome-Scale PhylogeneticsSystematic Biology, 2005
- A review of long‐branch attractionCladistics, 2005
- Should we be worried about long-branch attraction in real data sets? Investigations using metazoan 18S rDNAMolecular Phylogenetics and Evolution, 2004
- The analysis of 100 genes supports the grouping of three highly divergent amoebae: Dictyostelium , Entamoeba , and MastigamoebaProceedings of the National Academy of Sciences, 2002
- Morphology, Fossils, Divergence Timing, and the Phylogenetic Relationships of GavialisSystematic Biology, 1997
- Gene Trees and Species Trees: Molecular Systematics as One-Character TaxonomySystematic Botany, 1992
- CONFIDENCE LIMITS ON PHYLOGENIES: AN APPROACH USING THE BOOTSTRAPEvolution, 1985
- Cases in which Parsimony or Compatibility Methods Will be Positively MisleadingSystematic Zoology, 1978