Using a measure of structural variation to define a core for the globins
- 1 December 1995
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 11 (6) , 633-644
- https://doi.org/10.1093/bioinformatics/11.6.633
Abstract
As the database of three-dimensional protein structures expands, it becomes possible to classify related structures into families. Some of these families, such as the globins, have enough members to allow statistical analysis of conserved features. Previously, we have shown that a probabilistic representation based on means and variances can be useful for defining structural cores for large families. These cores contain the subset of atoms that are in essentially the same relative positions in all members of the family. In addition to defining a core, our method creates an ordered list of atoms, ranked by their structural variation. In applying our core-finding procedure to the globins, we find that helices A, B, G and H form a structural core with low variance. These helices fold early in the folding pathway, and superimpose well with helices in the helix-turn-helix repressor protein family. The non-core helices (F and the parts of other helices that interact with it) are associated with the functional differences among the globins, and are encoded within a separate exon. We have also compared the variablity measure implicit in our core structures with measures of sequence variability, using a procedure for measuring sequence variability that helps correct for the biased sampling in the databanks. We find, somewhat surprisingly, that sequence variation does not appear to correlate with structural variation.Keywords
This publication has 12 references indexed in Scilit:
- Tertiary templates for proteins: Use of packing criteria in the enumeration of allowed sequences for different structural classesPublished by Elsevier ,2005
- Average Core Structures and Variability Measures for Protein Families: Application to the ImmunoglobulinsJournal of Molecular Biology, 1995
- Methods for displaying macromolecular structural uncertainty: Application to the globinsJournal of Molecular Graphics, 1995
- Volume changes in protein evolutionJournal of Molecular Biology, 1994
- Identification and classification of protein fold familiesProtein Engineering, Design and Selection, 1993
- A data bank merging related protein structures and sequencesProtein Engineering, Design and Selection, 1992
- Database of homology‐derived protein structures and the structural meaning of sequence alignmentProteins-Structure Function and Bioinformatics, 1991
- THE CLASSIFICATION AND ORIGINS OF PROTEIN FOLDING PATTERNSAnnual Review of Biochemistry, 1990
- General architecture of the α-helical globuleJournal of Molecular Biology, 1988
- Determinants of a protein foldJournal of Molecular Biology, 1987