Solving and analyzing side-chain positioning problems using linear and integer programming
Open Access
- 16 November 2004
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 21 (7) , 1028-1039
- https://doi.org/10.1093/bioinformatics/bti144
Abstract
Motivation: Side-chain positioning is a central component of homology modeling and protein design. In a common formulation of the problem, the backbone is fixed, side-chain conformations come from a rotamer library, and a pairwise energy function is optimized. It is NP-complete to find even a reasonable approximate solution to this problem. We seek to put this hardness result into practical context. Results: We present an integer linear programming (ILP) formulation of side-chain positioning that allows us to tackle large problem sizes. We relax the integrality constraint to give a polynomial-time linear programming (LP) heuristic. We apply LP to position side chains on native and homologous backbones and to choose side chains for protein design. Surprisingly, when positioning side chains on native and homologous backbones, optimal solutions using a simple, biologically relevant energy function can usually be found using LP. On the other hand, the design problem often cannot be solved using LP directly; however, optimal solutions for large instances can still be found using the computationally more expensive ILP procedure. While different energy functions also affect the difficulty of the problem, the LP/ILP approach is able to find optimal solutions. Our analysis is the first large-scale demonstration that LP-based approaches are highly effective in finding optimal (and successive near-optimal) solutions for the side-chain positioning problem. Availability: The source code for generating the ILP given a file of pairwise energies between rotamers is available online at http://compbio.cs.princeton.edu/scplp Contact:msingh@cs.princeton.eduKeywords
This publication has 36 references indexed in Scilit:
- Tertiary templates for proteins: Use of packing criteria in the enumeration of allowed sequences for different structural classesPublished by Elsevier ,2005
- A graph‐theory algorithm for rapid protein side‐chain predictionProtein Science, 2003
- Extending the accuracy limits of prediction for side-chain conformationsJournal of Molecular Biology, 2001
- Extending the accuracy limits of prediction for side-chain conformationsJournal of Molecular Biology, 2001
- Generalized dead-end elimination algorithms make large-scale protein side-chain structure prediction tractable: implications for protein design and structural genomicsJournal of Molecular Biology, 2001
- CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choiceNucleic Acids Research, 1994
- Backbone-dependent Rotamer Library for Proteins Application to Side-chain PredictionJournal of Molecular Biology, 1993
- Database algorithm for generating protein backbone and side-chain co-ordinates from a Cα trace: Application to model building and detection of co-ordinate errorsJournal of Molecular Biology, 1991
- Prediction of protein side-chain conformation by packing optimizationJournal of Molecular Biology, 1991
- Construction of side-chains in homology modellingJournal of Molecular Biology, 1989