How are close residues of protein structures distributed in primary sequence?
- 19 December 1995
- journal article
- research article
- Published by Proceedings of the National Academy of Sciences in Proceedings of the National Academy of Sciences
- Vol. 92 (26) , 12136-12140
- https://doi.org/10.1073/pnas.92.26.12136
Abstract
Structurally neighboring residues are categorized according to their separation in the primary sequence as proximal (1-4 positions apart) and otherwise distal, which in turn is divided into near (5-20 positions), far (21-50 positions), very far ( > 50 positions), and interchain (from different chains of the same structure). These categories describe the linear distance histogram (LDH) for three-dimensional neighboring residue types. Among the main results are the following: (i) nearest-neighbor hydrophobic residues tend to be increasingly distally separated in the linear sequence, thus most often connecting distinct secondary structure units. (ii) The LDHs of oppositely charged nearest-neighbors emphasize proximal positions with a subsidiary maximum for very far positions. (iii) Cysteine-cysteine structural interactions rarely involve proximal positions. (iv) The greatest numbers of interchain specific nearest-neighbors in protein structures are composed of oppositely charged residues. (v) The largest fraction of side-chain neighboring residues from beta-strands involves near positions, emphasizing associations between consecutive strands. (vi) Exposed residue pairs are predominantly located in proximal linear positions, while buried residue pairs principally correspond to far or very far distal positions. The results are principally invariant to protein sizes, amino acid usages, linear distance normalizations, and over- and underrepresentations among nearest-neighbor types. Interpretations and hypotheses concerning the LDHs, particularly those of hydrophobic and charged pairings, are discussed with respect to protein stability and functionality. The pronounced occurrence of oppositely charged interchain contacts is consistent with many observations on protein complexes where multichain stabilization is facilitated by electrostatic interactions.Keywords
This publication has 26 references indexed in Scilit:
- Statistical significance of sequence patterns in proteinsCurrent Opinion in Structural Biology, 1995
- Polar zippers: Their role in human diseaseProtein Science, 1994
- Satisfying Hydrogen Bonding Potential in ProteinsJournal of Molecular Biology, 1994
- The Role of Interhelical Ionic Interactions in Controlling Protein Folding and Stability: De Novo Designed Synthetic Two-stranded α-Helical Coiled-CoilsJournal of Molecular Biology, 1994
- An improved pair potential to recognize native protein foldsProteins-Structure Function and Bioinformatics, 1994
- Benzene Forms Hydrogen Bonds with WaterScience, 1992
- Amino Acid Preferences for Specific Locations at the Ends of α HelicesScience, 1988
- Do exons code for structural or functional units in proteins?Proceedings of the National Academy of Sciences, 1988
- Weakly Polar Interactions In ProteinsAdvances in Protein Chemistry, 1988
- Is there a single pathway for the folding of a polypeptide chain?Proceedings of the National Academy of Sciences, 1985