The structure of the protein universe and genome evolution
Top Cited Papers
- 14 November 2002
- journal article
- review article
- Published by Springer Nature in Nature
- Vol. 420 (6912) , 218-223
- https://doi.org/10.1038/nature01256
Abstract
Despite the practically unlimited number of possible protein sequences, the number of basic shapes in which proteins fold seems not only to be finite, but also to be relatively small, with probably no more than 10,000 folds in existence. Moreover, the distribution of proteins among these folds is highly non-homogeneous — some folds and superfamilies are extremely abundant, but most are rare. Protein folds and families encoded in diverse genomes show similar size distributions with notable mathematical properties, which also extend to the number of connections between domains in multidomain proteins. All these distributions follow asymptotic power laws, such as have been identified in a wide variety of biological and physical systems, and which are typically associated with scale-free networks. These findings suggest that genome evolution is driven by extremely general mechanisms based on the preferential attachment principle.Keywords
This publication has 76 references indexed in Scilit:
- Statistical mechanics of complex networksReviews of Modern Physics, 2002
- Automatic clustering of orthologs and in-paralogs from pairwise species comparisonsJournal of Molecular Biology, 2001
- Regulatory potential, phyletic distribution and evolution of ancient, intracellular small-molecule-binding domains11Edited by F. CohenJournal of Molecular Biology, 2001
- Initial sequencing and analysis of the human genomeNature, 2001
- Estimating the number of protein folds and families from complete genome data 1 1Edited by J. ThorntonJournal of Molecular Biology, 2000
- Who's your neighbor? New computational approaches for functional genomicsNature Biotechnology, 2000
- Estimating the number of protein foldsJournal of Molecular Biology, 1998
- CATH – a hierarchic classification of protein domain structuresPublished by Elsevier ,1997
- One thousand families for the molecular biologistNature, 1992
- The appearance of new structures and functions in proteins during evolutionJournal of Molecular Evolution, 1975