From Gene Trees to Organismal Phylogeny in Prokaryotes:The Case of the γ-Proteobacteria
Top Cited Papers
Open Access
- 15 September 2003
- journal article
- research article
- Published by Public Library of Science (PLoS) in PLoS Biology
- Vol. 1 (1) , e19
- https://doi.org/10.1371/journal.pbio.0000019
Abstract
The rapid increase in published genomic sequences for bacteria presents the first opportunity to reconstruct evolutionary events on the scale of entire genomes. However, extensive lateral gene transfer (LGT) may thwart this goal by preventing the establishment of organismal relationships based on individual gene phylogenies. The group for which cases of LGT are most frequently documented and for which the greatest density of complete genome sequences is available is the γ-Proteobacteria, an ecologically diverse and ancient group including free-living species as well as pathogens and intracellular symbionts of plants and animals. We propose an approach to multigene phylogeny using complete genomes and apply it to the case of the γ-Proteobacteria. We first applied stringent criteria to identify a set of likely gene orthologs and then tested the compatibilities of the resulting protein alignments with several phylogenetic hypotheses. Our results demonstrate phylogenetic concordance among virtually all (203 of 205) of the selected gene families, with each of the exceptions consistent with a single LGT event. The concatenated sequences of the concordant families yield a fully resolved phylogeny. This topology also received strong support in analyses aimed at excluding effects of heterogeneity in nucleotide base composition across lineages. Our analysis indicates that single-copy orthologous genes are resistant to horizontal transfer, even in ancient bacterial groups subject to high rates of LGT. This gene set can be identified and used to yield robust hypotheses for organismal phylogenies, thus establishing a foundation for reconstructing the evolutionary transitions, such as gene transfer, that underlie diversity in genome content and organization.Keywords
This publication has 61 references indexed in Scilit:
- Ancient horizontal gene transferNature Reviews Genetics, 2003
- Genome sequence of the endocellular obligate symbiont of tsetse flies, Wigglesworthia glossinidiaNature Genetics, 2002
- A Phylogenomic Approach to Bacterial Phylogeny: Evidence of a Core of Genes Sharing a Common HistoryGenome Research, 2002
- Comparison of the genomes of two Xanthomonas pathogens with differing host specificitiesNature, 2002
- Genome sequence of Yersinia pestis, the causative agent of plagueNature, 2001
- Multiple Comparisons of Log-Likelihoods with Applications to Phylogenetic InferenceMolecular Biology and Evolution, 1999
- The Complete Genome Sequence of Escherichia coli K-12Science, 1997
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choiceNucleic Acids Research, 1994
- The rapid generation of mutation data matrices from protein sequencesBioinformatics, 1992