Evolutionary Origins of Genomic Repertoires in Bacteria
Open Access
- 5 April 2005
- journal article
- research article
- Published by Public Library of Science (PLoS) in PLoS Biology
- Vol. 3 (5) , e130
- https://doi.org/10.1371/journal.pbio.0030130
Abstract
Explaining the diversity of gene repertoires has been a major problem in modern evolutionary biology. In eukaryotes, this diversity is believed to result mainly from gene duplication and loss, but in prokaryotes, lateral gene transfer (LGT) can also contribute substantially to genome contents. To determine the histories of gene inventories, we conducted an exhaustive analysis of gene phylogenies for all gene families in a widely sampled group, the γ-Proteobacteria. We show that, although these bacterial genomes display striking differences in gene repertoires, most gene families having representatives in several species have congruent histories. Other than the few vast multigene families, gene duplication has contributed relatively little to the contents of these genomes; instead, LGT, over time, provides most of the diversity in genomic repertoires. Most such acquired genes are lost, but the majority of those that persist in genomes are transmitted strictly vertically. Although our analyses are limited to the γ-Proteobacteria, these results resolve a long-standing paradox—i.e., the ability to make robust phylogenetic inferences in light of substantial LGT.Keywords
This publication has 59 references indexed in Scilit:
- Comparative genomics, minimal gene-sets and the last universal common ancestorNature Reviews Microbiology, 2003
- Evolution by gene duplication: an updateTrends in Ecology & Evolution, 2003
- The structure of the protein universe and genome evolutionNature, 2002
- Genome sequence of the endocellular obligate symbiont of tsetse flies, Wigglesworthia glossinidiaNature Genetics, 2002
- A Phylogenomic Approach to Bacterial Phylogeny: Evidence of a Core of Genes Sharing a Common HistoryGenome Research, 2002
- Comparison of the genomes of two Xanthomonas pathogens with differing host specificitiesNature, 2002
- Genome sequence of Yersinia pestis, the causative agent of plagueNature, 2001
- Multiple Comparisons of Log-Likelihoods with Applications to Phylogenetic InferenceMolecular Biology and Evolution, 1999
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- The rapid generation of mutation data matrices from protein sequencesBioinformatics, 1992