Efficient parameter estimation for RNA secondary structure prediction
Open Access
- 1 July 2007
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 23 (13) , i19-i28
- https://doi.org/10.1093/bioinformatics/btm223
Abstract
Motivation: Accurate prediction of RNA secondary structure from the base sequence is an unsolved computational challenge. The accuracy of predictions made by free energy minimization is limited by the quality of the energy parameters in the underlying free energy model. The most widely used model, the Turner99 model, has hundreds of parameters, and so a robust parameter estimation scheme should efficiently handle large data sets with thousands of structures. Moreover, the estimation scheme should also be trained using available experimental free energy data in addition to structural data. Results: In this work, we present constraint generation (CG), the first computational approach to RNA free energy parameter estimation that can be efficiently trained on large sets of structural as well as thermodynamic data. Our CG approach employs a novel iterative scheme, whereby the energy values are first computed as the solution to a constrained optimization problem. Then the newly computed energy parameters are used to update the constraints on the optimization function, so as to better optimize the energy parameters in the next iteration. Using our method on biologically sound data, we obtain revised parameters for the Turner99 energy model. We show that by using our new parameters, we obtain significant improvements in prediction accuracy over current state of-the-art methods. Availability: Our CG implementation is available at http://www.rnasoft.ca/CG/ Contact:andrones@cs.ubc.caKeywords
This publication has 15 references indexed in Scilit:
- CONTRAfold: RNA secondary structure prediction without physics-based modelsBioinformatics, 2006
- Triggered amplification by hybridization chain reactionProceedings of the National Academy of Sciences, 2004
- Using an RNA secondary structure partition function to determine confidence in base pairs predicted by free energy minimizationRNA, 2004
- Incorporating chemical modification constraints into a dynamic programming algorithm for prediction of RNA secondary structureProceedings of the National Academy of Sciences, 2004
- An autonomous molecular computer for logical control of gene expressionNature, 2004
- Engineered allosteric ribozymes as biosensor componentsCurrent Opinion in Biotechnology, 2002
- The Comparative RNA Web (CRW) Site: an online database of comparative sequence and structure information for ribosomal, intron, and other RNAsBMC Bioinformatics, 2002
- Expanded sequence dependence of thermodynamic parameters improves prediction of RNA secondary structureJournal of Molecular Biology, 1999
- Fast folding and comparison of RNA secondary structuresMonatshefte für Chemie / Chemical Monthly, 1994
- The equilibrium partition function and base pair binding probabilities for RNA secondary structureBiopolymers, 1990