Improved RNA secondary structure prediction by maximizing expected pair accuracy
Top Cited Papers
- 24 August 2009
- journal article
- Published by Cold Spring Harbor Laboratory in RNA
- Vol. 15 (10) , 1805-1813
- https://doi.org/10.1261/rna.1643609
Abstract
Free energy minimization has been the most popular method for RNA secondary structure prediction for decades. It is based on a set of empirical free energy change parameters derived from experiments using a nearest-neighbor model. In this study, a program, MaxExpect, that predicts RNA secondary structure by maximizing the expected base-pair accuracy, is reported. This approach was first pioneered in the program CONTRAfold, using pair probabilities predicted with a statistical learning method. Here, a partition function calculation that utilizes the free energy change nearest-neighbor parameters is used to predict base-pair probabilities as well as probabilities of nucleotides being single-stranded. MaxExpect predicts both the optimal structure (having highest expected pair accuracy) and suboptimal structures to serve as alternative hypotheses for the structure. Tested on a large database of different types of RNA, the maximum expected accuracy structures are, on average, of higher accuracy than minimum free energy structures. Accuracy is measured by sensitivity, the percentage of known base pairs correctly predicted, and positive predictive value (PPV), the percentage of predicted pairs that are in the known structure. By favoring double-strandedness or single-strandedness, a higher sensitivity or PPV of prediction can be favored, respectively. Using MaxExpect, the average PPV of optimal structure is improved from 66% to 68% at the same sensitivity level (73%) compared with free energy minimization.Keywords
This publication has 47 references indexed in Scilit:
- Prediction of RNA secondary structure using generalized centroid estimatorsBioinformatics, 2008
- Prediction of RNA secondary structure by free energy minimizationCurrent Opinion in Structural Biology, 2006
- Using an RNA secondary structure partition function to determine confidence in base pairs predicted by free energy minimizationRNA, 2004
- Evaluation of several lightweight stochastic context-free grammars for RNA secondary structure predictionBMC Bioinformatics, 2004
- Incorporating chemical modification constraints into a dynamic programming algorithm for prediction of RNA secondary structureProceedings of the National Academy of Sciences, 2004
- A statistical sampling algorithm for RNA secondary structure predictionNucleic Acids Research, 2003
- Rfam: an RNA family databaseNucleic Acids Research, 2003
- Secondary Structure Prediction for Aligned RNA SequencesJournal of Molecular Biology, 2002
- Computational Genomics of Noncoding RNA GenesCell, 2002
- Experimentally Derived Nearest-Neighbor Parameters for the Stability of RNA Three- and Four-Way Multibranch LoopsBiochemistry, 2001