Understanding Protein Flexibility through Dimensionality Reduction
- 1 June 2003
- journal article
- Published by Mary Ann Liebert Inc in Journal of Computational Biology
- Vol. 10 (3-4) , 617-634
- https://doi.org/10.1089/10665270360688228
Abstract
This work shows how to decrease the complexity of modeling flexibility in proteins by reducing the number of dimensions necessary to model important macromolecular motions such as the induced-fit process. Induced fit occurs during the binding of a protein to other proteins, nucleic acids, or small molecules (ligands) and is a critical part of protein function. It is now widely accepted that conformational changes of proteins can affect their ability to bind other molecules and that any progress in modeling protein motion and flexibility will contribute to the understanding of key biological functions. However, modeling protein flexibility has proven a very difficult task. Experimental laboratory methods, such as x-ray crystallography, produce rather limited information, while computational methods such as molecular dynamics are too slow for routine use with large systems. In this work, we show how to use the principal component analysis method, a dimensionality reduction technique, to transform the original high-dimensional representation of protein motion into a lower dimensional representation that captures the dominant modes of motions of proteins. For a medium-sized protein, this corresponds to reducing a problem with a few thousand degrees of freedom to one with less than fifty. Although there is inevitably some loss in accuracy, we show that we can obtain conformations that have been observed in laboratory experiments, starting from different initial conformations and working in a drastically reduced search space.Keywords
This publication has 47 references indexed in Scilit:
- The Protein Data BankNucleic Acids Research, 2000
- Harmonic modes as variables to approximately account for receptor flexibility in ligand-receptor docking simulations: Application to DNA minor groove ligand complexJournal of Computational Chemistry, 1999
- All-Atom Empirical Potential for Molecular Modeling and Dynamics Studies of ProteinsThe Journal of Physical Chemistry B, 1998
- Essential dynamics of proteinsProteins-Structure Function and Bioinformatics, 1993
- Principal curves revisitedStatistics and Computing, 1992
- Efficient computation of three-dimensional protein structures in solution from nuclear magnetic resonance data using the program DIANA and the supporting programs CALIBA, HABAS and GLOMSAJournal of Molecular Biology, 1991
- Protein normal-mode dynamics: Trypsin inhibitor, crambin, ribonuclease and lysozymeJournal of Molecular Biology, 1985
- Molecular dynamics with coupling to an external bathThe Journal of Chemical Physics, 1984
- Nonmetric Multidimensional Scaling: A Numerical MethodPsychometrika, 1964
- Analysis of a complex of statistical variables into principal components.Journal of Educational Psychology, 1933