Evaluation of several lightweight stochastic context-free grammars for RNA secondary structure prediction
Open Access
- 4 June 2004
- journal article
- research article
- Published by Springer Nature in BMC Bioinformatics
- Vol. 5 (1) , 71
- https://doi.org/10.1186/1471-2105-5-71
Abstract
RNA secondary structure prediction methods based on probabilistic modeling can be developed using stochastic context-free grammars (SCFGs). Such methods can readily combine different sources of information that can be expressed probabilistically, such as an evolutionary model of comparative RNA sequence analysis and a biophysical model of structure plausibility. However, the number of free parameters in an integrated model for consensus RNA structure prediction can become untenable if the underlying SCFG design is too complex. Thus a key question is, what small, simple SCFG designs perform best for RNA secondary structure prediction? Nine different small SCFGs were implemented to explore the tradeoffs between model complexity and prediction accuracy. Each model was tested for single sequence structure prediction accuracy on a benchmark set of RNA secondary structures. Four SCFG designs had prediction accuracies near the performance of current energy minimization programs. One of these designs, introduced by Knudsen and Hein in their PFOLD algorithm, has only 21 free parameters and is significantly simpler than the others.Keywords
This publication has 57 references indexed in Scilit:
- Secondary Structure Prediction for Aligned RNA SequencesJournal of Molecular Biology, 2002
- Dynalign: an algorithm for finding the secondary structure common to two RNA sequencesJournal of Molecular Biology, 2002
- Non–coding RNA genes and the modern RNA worldNature Reviews Genetics, 2001
- RNA Secondary Structure Prediction Based on Free Energy and Phylogenetic AnalysisJournal of Molecular Biology, 1999
- Expanded sequence dependence of thermodynamic parameters improves prediction of RNA secondary structureJournal of Molecular Biology, 1999
- A dynamic programming algorithm for RNA structure prediction including pseudoknots 1 1Edited by I. TinocoJournal of Molecular Biology, 1999
- Modelling of the three-dimensional architecture of group I catalytic introns based on comparative sequence analysisJournal of Molecular Biology, 1990
- The equilibrium partition function and base pair binding probabilities for RNA secondary structureBiopolymers, 1990
- Phylogenetic comparative analysis and the secondary structure of ribonuclease P RNA — a reviewGene, 1989
- RNA secondary structure: a complete mathematical analysisMathematical Biosciences, 1978