Predicting helix–helix interactions from residue contacts in membrane proteins
Open Access
- 25 February 2009
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 25 (8) , 996-1003
- https://doi.org/10.1093/bioinformatics/btp114
Abstract
Motivation: Helix–helix interactions play a critical role in the structure assembly, stability and function of membrane proteins. On the molecular level, the interactions are mediated by one or more residue contacts. Although previous studies focused on helix-packing patterns and sequence motifs, few of them developed methods specifically for contact prediction.Results: We present a new hierarchical framework for contact prediction, with an application in membrane proteins. The hierarchical scheme consists of two levels: in the first level, contact residues are predicted from the sequence and their pairing relationships are further predicted in the second level. Statistical analyses on contact propensities are combined with other sequence and structural information for training the support vector machine classifiers. Evaluated on 52 protein chains using leave-one-out cross validation (LOOCV) and an independent test set of 14 protein chains, the two-level approach consistently improves the conventional direct approach in prediction accuracy, with 80% reduction of input for prediction. Furthermore, the predicted contacts are then used to infer interactions between pairs of helices. When at least three predicted contacts are required for an inferred interaction, the accuracy, sensitivity and specificity are 56%, 40% and 89%, respectively. Our results demonstrate that a hierarchical framework can be applied to eliminate false positives (FP) while reducing computational complexity in predicting contacts. Together with the estimated contact propensities, this method can be used to gain insights into helix-packing in membrane proteins.Availability: http://bio-cluster.iis.sinica.edu.tw/TMhit/Contact: tsung@iis.sinica.edu.twSupplementary information: Supplementary data are available at Bioinformatics online.Keywords
This publication has 45 references indexed in Scilit:
- Predicting protein folding rates from geometric contact and amino acid sequenceProtein Science, 2008
- Using inferred residue contacts to distinguish between correct and incorrect protein modelsBioinformatics, 2008
- Prediction of membrane-protein topology from first principlesProceedings of the National Academy of Sciences, 2008
- Co-evolving residues in membrane proteinsBioinformatics, 2007
- TOPDB: topology data bank of transmembrane proteinsNucleic Acids Research, 2007
- Prediction of the burial status of transmembrane residues of helical membrane proteinsBMC Bioinformatics, 2007
- Helix-packing motifs in membrane proteinsProceedings of the National Academy of Sciences, 2006
- Mapping pathways of allosteric communication in GroEL by analysis of correlated mutationsProteins-Structure Function and Bioinformatics, 2002
- The Protein Data BankNucleic Acids Research, 2000
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997