An Evolutionarily Structured Universe of Protein Architecture
- 1 July 2003
- journal article
- Published by Cold Spring Harbor Laboratory in Genome Research
- Vol. 13 (7) , 1563-1571
- https://doi.org/10.1101/gr.1161903
Abstract
Protein structural diversity encompasses a finite set of architectural designs. Embedded in these topologies are evolutionary histories that we here uncover using cladistic principles and measurements of protein-fold usage and sharing. The reconstructed phylogenies are inherently rooted and depict histories of protein and proteome diversification. Proteome phylogenies showed two monophyletic sister-groups delimiting Bacteria and Archaea, and a topology rooted in Eucarya. This suggests three dramatic evolutionary events and a common ancestor with a eukaryotic-like, gene-rich, and relatively modern organization. Conversely, a general phylogeny of protein architectures showed that structural classes of globular proteins appeared early in evolution and in defined order, the α/β class being the first. Although most ancestral folds shared a common architecture of barrels or interleaved β-sheets and α-helices, many were clearly derived, such as polyhedral folds in the all-α class and β-sandwiches, β-propellers, and β-prisms in all-β proteins. We also describe transformation pathways of architectures that are prevalently used in nature. For example, β-barrels with increased curl and stagger were favored evolutionary outcomes in the all-β class. Interestingly, we found cases where structural change followed the α-to-β tendency uncovered in the tree of architectures. Lastly, we traced the total number of enzymatic functions associated with folds in the trees and show that there is a general link between structure and enzymatic function.Keywords
This publication has 55 references indexed in Scilit:
- Monophyly of class I aminoacyl tRNA synthetase, USPA, ETFP, photolyase, and PP‐ATPase nucleotide‐binding domains: implications for protein evolution in the RNA worldProteins-Structure Function and Bioinformatics, 2002
- Evidence Suggesting That a Fifth of Annotated Caenorhabditis elegans Genes May Be PseudogenesGenome Research, 2002
- Protein family and fold occurrence in genomes: power-law behaviour and evolutionary modelJournal of Molecular Biology, 2001
- PROTEIN FOLDS IN THE ALL-β AND ALL-α CLASSESAnnual Review of Biophysics, 1997
- Protein evolution viewed through Escherichia coli Protein sequences: Introducing the notion of a structural segment of homology, the moduleJournal of Molecular Biology, 1997
- Global Statistics of Protein Sequences: Implications for the Origin, Evolution, and Prediction of StructureAnnual Review of Biophysics, 1994
- The Holy Grail of the Perfect Character: the Cladistic Treatment of Morphometric DataCladistics, 1993
- Confidence Limits on Phylogenies: An Approach Using the BootstrapEvolution, 1985
- Prokaryotes and eukaryotes: strategies and successesTrends in Biochemical Sciences, 1982
- Gene duplications in the structural evolution of chymotrypsinJournal of Molecular Biology, 1979