Constructing amino acid residue substitution classes maximally indicative of local protein structure
- 1 May 1996
- journal article
- research article
- Published by Wiley in Proteins-Structure Function and Bioinformatics
- Vol. 25 (1) , 28-37
- https://doi.org/10.1002/(sici)1097-0134(199605)25:1<28::aid-prot3>3.0.co;2-g
Abstract
Using an information theoretic formalism, we optimize classes of amino acid substitution to be maximally indicative of local protein structure. Our statistically‐derived classes are loosely identifiable with the heuristic constructions found in previously published work. However, while these other methods provide a more rigid idealization of physicochemically constrained residue substitution, our classes provide substantially more structural information with many fewer parameters. Moreover, these substitution classes are consistent with the paradigmatic view of the sequence‐to‐structure relationship in globular proteins which holds that the three‐dimensional architecture is predominantly determined by the arrangement of hydrophobic and polar side chains with weak constraints on the actual amino acid identities. More specific constraints are imposed on the placement of prolines, glycines, and the charged residues. These substitution classes have been used in highly accurate predictions of residue solvent accessibility. They could also be used in the identification of homologous proteins, the construction and refinement of multiple sequence alignments, and as a means of condensing and codifying the information in multiple sequence alignments for secondary structure prediction and tertiary fold recognition.Keywords
This publication has 61 references indexed in Scilit:
- Environment and exposure to solvent of protein atoms. Lysozyme and insulinPublished by Elsevier ,2004
- Prediction of Protein Secondary Structure by Combining Nearest-neighbor Algorithms and Multiple Sequence AlignmentsJournal of Molecular Biology, 1995
- Enlarged representative set of protein structuresProtein Science, 1994
- Bona Fide Prediction of Aspects of Protein Conformation: Assigning Interior and Surface Residues from Patterns of Variation and Conservation in Homologous Protein SequencesJournal of Molecular Biology, 1994
- Features of spliceosome evolution and function inferred from an analysis of the information at human splice sitesJournal of Molecular Biology, 1992
- Prediction of the occurrence of the ADP-binding βαβ-fold in proteins, using an amino acid sequence fingerprintJournal of Molecular Biology, 1986
- Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical featuresBiopolymers, 1983
- Analysis of the accuracy and implications of simple methods for predicting the secondary structure of globular proteinsJournal of Molecular Biology, 1978
- The protein data bank: A computer-based archival file for macromolecular structuresJournal of Molecular Biology, 1977