Predicting RNA-binding sites from the protein structure based on electrostatics, evolution and geometry
Open Access
- 14 February 2008
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 36 (5) , e29
- https://doi.org/10.1093/nar/gkn008
Abstract
An RNA-binding protein places a surface helix, β-ribbon, or loop in an RNA helix groove and/or uses a cavity to accommodate unstacked bases. Hence, our strategy for predicting RNA-binding residues is based on detecting a surface patch and a disparate cleft. These were generated and scored according to the gas-phase electrostatic energy change upon mutating each residue to Asp − /Glu − and each residue's relative conservation. The method requires as input the protein structure and sufficient homologous sequences to define each residue's relative conservation. It yields as output a priority list of surface patch residues followed by a backup list of surface cleft residues distant from the patch residues for experimental testing of RNA binding. Among the 69 structurally non-homologous proteins tested, 81% possess a RNA-binding site with at least 70% of the maximum number of true positives in randomly generated patches of the same size as the predicted site; only two proteins did not contain any true RNA-binding residues in both predicted regions. Regardless of the protein conformational changes upon RNA-binding, the prediction accuracies based on the RNA-free/bound protein structures were found to be comparable and their binding sites overlapped as long as there are no disordered RNA-binding regions in the free structure that are ordered in the corresponding RNA-bound protein structure.Keywords
This publication has 34 references indexed in Scilit:
- Amino acid residue doublet propensity in the protein–RNA interface and its application to RNA interface predictionNucleic Acids Research, 2006
- BindN: a web-based tool for efficient prediction of DNA and RNA binding sites in amino acid sequencesNucleic Acids Research, 2006
- Prediction of RNA binding sites in proteins from amino acid sequenceRNA, 2006
- Predicting rRNA-, RNA-, and DNA-binding proteins from primary structure with support vector machinesPublished by Elsevier ,2005
- A point‐charge force field for molecular mechanics simulations of proteins based on condensed‐phase quantum mechanical calculationsJournal of Computational Chemistry, 2003
- A graph‐theory algorithm for rapid protein side‐chain predictionProtein Science, 2003
- Statistical analysis of atomic contacts at RNA–protein interfacesJournal of Molecular Recognition, 2001
- The Protein Data BankNucleic Acids Research, 2000
- RNA–protein complexesCurrent Opinion in Structural Biology, 1999
- Prediction of protein-protein interaction sites using patch analysis 1 1Edited by G. von HeijneJournal of Molecular Biology, 1997