Phylogenetic discovery bias in Bacillus anthracis using single-nucleotide polymorphisms from whole-genome sequencing
- 3 September 2004
- journal article
- research article
- Published by Proceedings of the National Academy of Sciences in Proceedings of the National Academy of Sciences
- Vol. 101 (37) , 13536-13541
- https://doi.org/10.1073/pnas.0403844101
Abstract
Phylogenetic reconstruction using molecular data is often subject to homoplasy, leading to inaccurate conclusions about phylogenetic relationships among operational taxonomic units. Compared with other molecular markers, single-nucleotide polymorphisms (SNPs) exhibit extremely low mutation rates, making them rare in recently emerged pathogens, but they are less prone to homoplasy and thus extremely valuable for phylogenetic analyses. Despite their phylogenetic potential, ascertainment bias occurs when SNP characters are discovered through biased taxonomic sampling; by using whole-genome comparisons of five diverse strains of Bacillus anthracis to facilitate SNP discovery, we show that only polymorphisms lying along the evolutionary pathway between reference strains will be observed. We illustrate this in theoretical and simulated data sets in which complex phylogenetic topologies are reduced to linear evolutionary models. Using a set of 990 SNP markers, we also show how divergent branches in our topologies collapse to single points but provide accurate information on internodal distances and points of origin for ancestral clades. These data allowed us to determine the ancestral root of B . anthracis , showing that it lies closer to a newly described “C” branch than to either of two previously described “A” or “B” branches. In addition, subclade rooting of the C branch revealed unequal evolutionary rates that seem to be correlated with ecological parameters and strain attributes. Our use of nonhomoplastic whole-genome SNP characters allows branch points and clade membership to be estimated with great precision, providing greater insight into epidemiological, ecological, and forensic questions.Keywords
This publication has 15 references indexed in Scilit:
- Anthrax molecular epidemiology and forensics: using the appropriate marker for different evolutionary scalesInfection, Genetics and Evolution, 2004
- Fluorescent Amplified Fragment Length Polymorphism Analysis of Bacillus anthracis , Bacillus cereus , and Bacillus thuringiensis IsolatesApplied and Environmental Microbiology, 2004
- Modeling Bacterial Evolution with Comparative-Genome-Based Marker Systems: Application to Mycobacterium tuberculosis Evolution and PathogenesisJournal of Bacteriology, 2003
- Diversity among French Bacillus anthracis IsolatesJournal of Clinical Microbiology, 2002
- Genome-Wide Analysis of Synonymous Single Nucleotide Polymorphisms in Mycobacterium tuberculosis Complex Organisms: Resolution of Genetic Relationships Among Closely Related Microbial StrainsGenetics, 2002
- Comparative Genome Sequencing for Discovery of Novel Polymorphisms in Bacillus anthracisScience, 2002
- MEGA2: molecular evolutionary genetics analysis softwareBioinformatics, 2001
- Blocks of Limited Haplotype Diversity Revealed by High-Resolution Scanning of Human Chromosome 21Science, 2001
- Multiple-Locus Variable-Number Tandem Repeat Analysis Reveals Genetic Relationships within Bacillus anthracisJournal of Bacteriology, 2000
- Molecular evolution and diversity in Bacillus anthracis as detected by amplified fragment length polymorphism markersJournal of Bacteriology, 1997