Comparative Genomics of the Archaea (Euryarchaeota): Evolution of Conserved Protein Families, the Stable Core, and the Variable Shell
- 1 July 1999
- journal article
- research article
- Published by Cold Spring Harbor Laboratory in Genome Research
- Vol. 9 (7) , 608-628
- https://doi.org/10.1101/gr.9.7.608
Abstract
Comparative analysis of the protein sequences encoded in the four euryarchaeal species whose genomes have been sequenced completely (Methanococcus jannaschii, Methanobacterium thermoautotrophicum, Archaeoglobus fulgidus, andPyrococcus horikoshii) revealed 1326 orthologous sets, of which 543 are represented in all four species. The proteins that belong to these conserved euryarchaeal families comprise 31%–35% of the gene complement and may be considered the evolutionarily stable core of the archaeal genomes. The core gene set includes the great majority of genes coding for proteins involved in genome replication and expression, but only a relatively small subset of metabolic functions. For many gene families that are conserved in all euryarchaea, previously undetected orthologs in bacteria and eukaryotes were identified. A number of euryarchaeal synapomorphies (unique shared characters) were identified; these are protein families that possess sequence signatures or domain architectures that are conserved in all euryarchaea but are not found in bacteria or eukaryotes. In addition, euryarchaea-specific expansions of several protein and domain families were detected. In terms of their apparent phylogenetic affinities, the archaeal protein families split into bacterial and eukaryotic families. The majority of the proteins that have only eukaryotic orthologs or show the greatest similarity to their eukaryotic counterparts belong to the core set. The families of euryarchaeal genes that are conserved in only two or three species constitute a relatively mobile component of the genomes whose evolution should have involved multiple events of lineage-specific gene loss and horizontal gene transfer. Frequently these proteins have detectable orthologs only in bacteria or show the greatest similarity to the bacterial homologs, which might suggest a significant role of horizontal gene transfer from bacteria in the evolution of the euryarchaeota.Keywords
This publication has 85 references indexed in Scilit:
- The universal stress protein, UspA, of Escherichia coli is phosphorylated in response to stasisJournal of Molecular Biology, 1997
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- Protein evolution viewed through Escherichia coli Protein sequences: Introducing the notion of a structural segment of homology, the moduleJournal of Molecular Biology, 1997
- NAC covers ribosome-associated nascent chains thereby forming a protective environment for regions of nascent chains just emerging from the peptidyl transferase center.The Journal of cell biology, 1995
- Archaebacterial genomes: eubacterial form and eukaryotic contentCurrent Opinion in Genetics & Development, 1994
- CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choiceNucleic Acids Research, 1994
- Basic Local Alignment Search ToolJournal of Molecular Biology, 1990
- Sequence, organization, transcription and evolution of RNA polymerase subunit genes from the archaebacterial extreme halophiles Halobacterium halobium and Halococcus morrhuaeJournal of Molecular Biology, 1989
- The phylogenetic relations of DNA-dependent RNA polymerases of archaebacteria, eukaryotes, and eubacteriaCanadian Journal of Microbiology, 1989
- ArchaebacteriaJournal of Molecular Evolution, 1978