Nature of the protein universe
Top Cited Papers
- 7 July 2009
- journal article
- research article
- Published by Proceedings of the National Academy of Sciences in Proceedings of the National Academy of Sciences
- Vol. 106 (27) , 11079-11084
- https://doi.org/10.1073/pnas.0905029106
Abstract
The protein universe is the set of all proteins of all organisms. Here, all currently known sequences are analyzed in terms of families that have single-domain or multidomain architectures and whether they have a known three-dimensional structure. Growth of new single-domain families is very slow: Almost all growth comes from new multidomain architectures that are combinations of domains characterized by ≈15,000 sequence profiles. Single-domain families are mostly shared by the major groups of organisms, whereas multidomain architectures are specific and account for species diversity. There are known structures for a quarter of the single-domain families, and >70% of all sequences can be partially modeled thanks to their membership in these families.Keywords
This publication has 42 references indexed in Scilit:
- Pfam 10 years on: 10 000 families and still growingBriefings in Bioinformatics, 2008
- The Sorcerer II Global Ocean Sampling Expedition: Expanding the Universe of Protein FamiliesPLoS Biology, 2007
- Growth of novel protein structural dataProceedings of the National Academy of Sciences, 2007
- FlowerPower: clustering proteins into domain architecture classes for phylogenomic inference of protein functionBMC Ecology and Evolution, 2007
- Modeling the Evolution of Protein Domain Architectures Using Maximum ParsimonyJournal of Molecular Biology, 2006
- The structure of the protein universe and genome evolutionNature, 2002
- CDART: Protein Homology by Domain ArchitectureGenome Research, 2002
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- One thousand families for the molecular biologistNature, 1992
- Basic Local Alignment Search ToolJournal of Molecular Biology, 1990