PROTEIN FAMILIES AND THEIR EVOLUTION—A STRUCTURAL PERSPECTIVE
- 1 June 2005
- journal article
- review article
- Published by Annual Reviews in Annual Review of Biochemistry
- Vol. 74 (1) , 867-900
- https://doi.org/10.1146/annurev.biochem.74.082803.133029
Abstract
▪ Abstract We can now assign about two thirds of the sequences from completed genomes to as few as 1400 domain families for which structures are known and thus more ancient evolutionary relationships established. About 200 of these domain families are common to all kingdoms of life and account for nearly 50% of domain structure annotations in the genomes. Some of these domain families have been very extensively duplicated within a genome and combined with different domain partners giving rise to different multidomain proteins. The ways in which these domain combinations evolve tend to be specific to the organism so that less than 15% of the protein families found within a genome appear to be common to all kingdoms of life. Recent analyses of completed genomes, exploiting the structural data, have revealed the extent to which duplication of these domains and modifications of their functions can expand the functional repertoire of the organism, contributing to increasing complexity.Keywords
This publication has 95 references indexed in Scilit:
- Gene regulatory network growth by duplicationNature Genetics, 2004
- The Pfam protein families databaseNucleic Acids Research, 2004
- Comparative genomics, minimal gene-sets and the last universal common ancestorNature Reviews Microbiology, 2003
- The Protein Data BankNucleic Acids Research, 2000
- Domain assignment for protein structures using a consensus approach: Characterization and analysisProtein Science, 1998
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- CATH – a hierarchic classification of protein domain structuresPublished by Elsevier ,1997
- Threading a database of protein coresProteins-Structure Function and Bioinformatics, 1995
- A new approach to protein fold recognitionNature, 1992
- One thousand families for the molecular biologistNature, 1992