Underlying order in protein sequence organization.
- 26 April 1994
- journal article
- Published by Proceedings of the National Academy of Sciences in Proceedings of the National Academy of Sciences
- Vol. 91 (9) , 4044-4047
- https://doi.org/10.1073/pnas.91.9.4044
Abstract
The idea of a possible standard modular structure of proteins has been known since 1929 when it was introduced by Svedberg. It still remains an idea with no quantitative confirmation of universality of such hypothetical organization. From a large collection of nonredundant protein sequences representing > 100 eukaryotic and prokaryotic species, we have obtained the protein sequence length distributions. Mere inspection of these distributions, as well as spectral analysis, shows that 15-30% of proteins, depending on species and sequence types, indeed appear to be made of sequence units with characteristic lengths of approximately 125 aa for eukaryotes and approximately 150 aa for prokaryotes. This underlying order in protein sequence organization is shown to be universal--that is, the weak regularity observed is not caused by a particular dominant species or protein group. Possible mechanisms are discussed that may be responsible for the observed regularity, including a hypothesis about the recombinational nature of such protein sequence organization.Keywords
This publication has 12 references indexed in Scilit:
- Soluble proteins: Size, shape and functionTrends in Biochemical Sciences, 1993
- Reconstructing history with amino acid sequences1Protein Science, 1992
- Quantile distributions of amino acid usage in protein classesProtein Engineering, Design and Selection, 1992
- The SWISS-PROT protein sequence data bankNucleic Acids Research, 1991
- Proteins of Escherichia coli come in sizes that are multiples of 14 kDa: domain concepts and evolutionary implications.Proceedings of the National Academy of Sciences, 1986
- Structural domains in proteins and their role in the dynamics of protein functionProgress in Biophysics and Molecular Biology, 1983
- DNA flexibility studied by covalent closure of short fragments into circles.Proceedings of the National Academy of Sciences, 1981
- The Anatomy and Taxonomy of Protein StructurePublished by Elsevier ,1981
- Folding of Protein FragmentsAdvances in Protein Chemistry, 1981
- Nucleation, Rapid Folding, and Globular Intrachain Regions in ProteinsProceedings of the National Academy of Sciences, 1973