Wiggle—Predicting Functionally Flexible Regions from Primary Sequence
Open Access
- 14 July 2006
- journal article
- research article
- Published by Public Library of Science (PLoS) in PLoS Computational Biology
- Vol. 2 (7) , e90
- https://doi.org/10.1371/journal.pcbi.0020090
Abstract
The Wiggle series are support vector machine–based predictors that identify regions of functional flexibility using only protein sequence information. Functionally flexible regions are defined as regions that can adopt different conformational states and are assumed to be necessary for bioactivity. Many advances have been made in understanding the relationship between protein sequence and structure. This work contributes to those efforts by making strides to understand the relationship between protein sequence and flexibility. A coarse-grained protein dynamic modeling approach was used to generate the dataset required for support vector machine training. We define our regions of interest based on the participation of residues in correlated large-scale fluctuations. Even with this structure-based approach to computationally define regions of functional flexibility, predictors successfully extract sequence-flexibility relationships that have been experimentally confirmed to be functionally important. Thus, a sequence-based tool to identify flexible regions important for protein function has been created. The ability to identify functional flexibility using a sequence based approach complements structure-based definitions and will be especially useful for the large majority of proteins with unknown structures. The methodology offers promise to identify structural genomics targets amenable to crystallization and the possibility to engineer more flexible or rigid regions within proteins to modify their bioactivity. Proteins are not static entities in biology and are constantly changing their shape and form to perform their necessary biological roles. While we are intuitively aware of their constantly changing nature, we have little understanding of how their flexibility is encoded in the protein sequence. To address this knowledge gap, predictors were created to identify sequence patterns that dictate local regions to be flexible and serve a functional purpose. By combining protein dynamic modeling and machine learning techniques, the Wiggle predictor series were able to generalize the sequence-flexibility relationship for all proteins. With these predictors we are able to identify flexible regions of functional importance such as hinges, recognition loops, and catalytic loops using only sequence information. This work has important contributions to our understanding of the sequence-flexibility relationship and paves the road to identifying local sequence modulations that impact protein function without necessarily changing the structure.Keywords
This publication has 82 references indexed in Scilit:
- Coupled Folding and Binding with α-Helix-Forming Molecular Recognition ElementsBiochemistry, 2005
- Progress of Structural Genomics Initiatives: An Analysis of Solved Target StructuresJournal of Molecular Biology, 2005
- The Pairwise Energy Content Estimated from Amino Acid Composition Discriminates between Folded and Intrinsically Unstructured ProteinsJournal of Molecular Biology, 2005
- Escherichia coli adenylate kinase dynamics: Comparison of elastic network model modes with mode‐coupling 15N‐NMR relaxation dataProteins-Structure Function and Bioinformatics, 2004
- The 1.0 Å crystal structure of Ca2+-bound calmodulin: an analysis of disorder and implications for functionally relevant plasticityJournal of Molecular Biology, 2000
- The Protein Data BankNucleic Acids Research, 2000
- Asp34 of PvuII endonuclease is directly involved in DNA minor groove recognition and indirectly involved in catalysisJournal of Molecular Biology, 1998
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- Enhanced protein flexibility caused by a destabilizing amino acid replacement in BPTIJournal of Molecular Biology, 1997
- Dissociation of a native dimer to a molten globule monomer: Effects of pressure and dilution on the association equilibrium of arc repressorJournal of Molecular Biology, 1992