Streamlining and Large Ancestral Genomes in Archaea Inferred with a Phylogenetic Birth-and-Death Model
Open Access
- 30 June 2009
- journal article
- research article
- Published by Oxford University Press (OUP) in Molecular Biology and Evolution
- Vol. 26 (9) , 2087-2095
- https://doi.org/10.1093/molbev/msp123
Abstract
Homologous genes originate from a common ancestor through vertical inheritance, duplication, or horizontal gene transfer. Entire homolog families spawned by a single ancestral gene can be identified across multiple genomes based on protein sequence similarity. The sequences, however, do not always reveal conclusively the history of large families. To study the evolution of complete gene repertoires, we propose here a mathematical framework that does not rely on resolved gene family histories. We show that so-called phylogenetic profiles, formed by family sizes across multiple genomes, are sufficient to infer principal evolutionary trends. The main novelty in our approach is an efficient algorithm to compute the likelihood of a phylogenetic profile in a model of birth-and-death processes acting on a phylogeny. We examine known gene families in 28 archaeal genomes using a probabilistic model that involves lineage- and family-specific components of gene acquisition, duplication, and loss. The model enables us to consider all possible histories when inferring statistics about archaeal evolution. According to our reconstruction, most lineages are characterized by a net loss of gene families. Major increases in gene repertoire have occurred only a few times. Our reconstruction underlines the importance of persistent streamlining processes in shaping genome composition in Archaea. It also suggests that early archaeal genomes were as complex as typical modern ones, and even show signs, in the case of the methanogenic ancestor, of an extremely large gene repertoire.Keywords
This publication has 52 references indexed in Scilit:
- Genomics of bacteria and archaea: the emerging dynamic view of the prokaryotic worldNucleic Acids Research, 2008
- Clusters of orthologous genes for 41 archaeal genomes and implications for evolutionary genomics of archaeaBiology Direct, 2007
- Reduced selection leads to accelerated gene loss in ShigellaGenome Biology, 2007
- Ancestral genome sizes specify the minimum rate of lateral gene transfer during prokaryote evolutionProceedings of the National Academy of Sciences, 2007
- Phylogenomic analysis of proteins that are distinctive of Archaea and its main subgroups and the origin of methanogenesisBMC Genomics, 2007
- The origin and evolution of Archaea: a state of the artPhilosophical Transactions Of The Royal Society B-Biological Sciences, 2006
- Evolutionary and functional genomics of the ArchaeaCurrent Opinion in Microbiology, 2005
- Horizontal gene transfer, genome innovation and evolutionNature Reviews Microbiology, 2005
- MUSCLE: multiple sequence alignment with high accuracy and high throughputNucleic Acids Research, 2004
- Evolutionary trees from DNA sequences: A maximum likelihood approachJournal of Molecular Evolution, 1981