Consensus Folding of Unaligned RNA Sequences Revisited
- 1 March 2006
- journal article
- research article
- Published by Mary Ann Liebert Inc in Journal of Computational Biology
- Vol. 13 (2) , 283-295
- https://doi.org/10.1089/cmb.2006.13.283
Abstract
As one of the earliest problems in computational biology, RNA secondary structure prediction (sometimes referred to as "RNA folding") problem has attracted attention again, thanks to the recent discoveries of many novel non-coding RNA molecules. The two common approaches to this problem are de novo prediction of RNA secondary structure based on energy minimization and the consensus folding approach (computing the common secondary structure for a set of unaligned RNA sequences). Consensus folding algorithms work well when the correct seed alignment is part of the input to the problem. However, seed alignment itself is a challenging problem for diverged RNA families. In this paper, we propose a novel framework to predict the common secondary structure for unaligned RNA sequences. By matching putative stacks in RNA sequences, we make use of both primary sequence information and thermodynamic stability for prediction at the same time. We show that our method can predict the correct common RNA secondary structures even when we are given only a limited number of unaligned RNA sequences, and it outperforms current algorithms in sensitivity and accuracy.Keywords
This publication has 43 references indexed in Scilit:
- MAVID: Constrained Ancestral Alignment of Multiple SequencesGenome Research, 2004
- Automated Whole-Genome Multiple Alignment of Rat, Mouse, and HumanGenome Research, 2004
- Unbiased Mapping of Transcription Factor Binding Sites along Human Chromosomes 21 and 22 Points to Widespread Regulation of Noncoding RNAsCell, 2004
- A Computational Model for RNA Multiple Structural AlignmentPublished by Springer Nature ,2004
- Non–coding RNA genes and the modern RNA worldNature Reviews Genetics, 2001
- Novel small RNA-encoding genes in the intergenic regions of Escherichia coliCurrent Biology, 2001
- A new method to predict the consensus secondary structure of a set of unaligned RNA sequences.Bioinformatics, 1999
- Finding the most significant common sequence and structure motifs in a set of RNA sequencesNucleic Acids Research, 1997
- Computing similarity between RNA stringsPublished by Springer Nature ,1995
- RNA sequence analysis using covariance modelsNucleic Acids Research, 1994