On the use of sequence homologies to predict protein structure: identical pentapeptides can have completely different conformations.
- 1 February 1984
- journal article
- research article
- Published by Proceedings of the National Academy of Sciences in Proceedings of the National Academy of Sciences
- Vol. 81 (4) , 1075-1078
- https://doi.org/10.1073/pnas.81.4.1075
Abstract
The search for amino acid sequence homologies can be a powerful tool for predicting protein structure. Discovered sequence homologies are currently used in predicting the function of oncogene proteins. To sharpen this tool, the structural significance of short sequence homologies was investigated by searching proteins of known 3-dimensional structure for subsequence identities. In 62 proteins with 10,000 residues, the longest isolated homologies between unrelated proteins are 5 residues long. In 6 (out of 25) cases there was surprising structural adaptability: the same 5 residues are part of an .alpha.-helix in one protein and part of a .beta.-strand in another protein. These examples show quantitatively that pentapeptide structure within a protein is strongly dependent on sequence context, a fact essentially ignored in most protein structure prediction methods: just considering the local sequence of 5 residues is not sufficient to predict correctly the local conformation (secondary structure). Cooperativity of length 6 or longer must be taken into account. Also, in the growing practice of comparing a new protein sequence with a data base of known sequences, finding an identical pentapeptide sequence between 2 proteins is not a significant indication of structural similarity or of evolutionary kinship.Keywords
This publication has 13 references indexed in Scilit:
- Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical featuresBiopolymers, 1983
- Simian Sarcoma Virus onc Gene, v- sis , Is Derived from the Gene (or Genes) Encoding a Platelet-Derived Growth FactorScience, 1983
- Platelet-derived growth factor is structurally related to the putative transforming protein p28sis of simian sarcoma virusNature, 1983
- How good are predictions of protein secondary structure?FEBS Letters, 1983
- Prediction of super-secondary structure in proteinsNature, 1983
- Theory of protein secondary structure and algorithm of its predictionBiopolymers, 1983
- Similar Amino Acid Sequences: Chance or Common Ancestry?Science, 1981
- Structure of erythrocruorin in different ligand states refined at 1·4 Å resolutionJournal of Molecular Biology, 1979
- The protein data bank: A computer-based archival file for macromolecular structuresJournal of Molecular Biology, 1977
- An evaluation of the relatedness of proteins based on comparison of amino acid sequencesJournal of Molecular Biology, 1970