Consensus shapes: an alternative to the Sankoff algorithm for RNA consensus structure prediction
Open Access
- 14 July 2005
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 21 (17) , 3516-3523
- https://doi.org/10.1093/bioinformatics/bti577
Abstract
Motivation: The well-known Sankoff algorithm for simultaneous RNA sequence alignment and folding is currently considered an ideal, but computationally over-expensive method. Available tools implement this algorithm under various pragmatic restrictions. They are still expensive to use, and it is difficult to judge if the moderate quality of results is because of the underlying model or to its imperfect implementation. Results: We propose to redefine the consensus structure prediction problem in a way that does not imply a multiple sequence alignment step. For a family of RNA sequences, our method explicitly and independently enumerates the near-optimal abstract shape space, and predicts as the consensus an abstract shape common to all sequences. For each sequence, it delivers the thermodynamically best structure which has this common shape. Since the shape space is much smaller than the structure space, and identification of common shapes can be done in linear time (in the number of shapes considered), the method is essentially linear in the number of sequences. Our evaluation shows that the new method compares favorably with available alternatives. Availability: The new method has been implemented in the program RNAcast and is available on the Bielefeld Bioinformatics Server. Contact:jreeder@TechFak.Uni-Bielefeld.DE,robert@TechFak.Uni-Bielefeld.DE Supplementary information: Available at http://bibiserv.techfak.uni-bielefeld.de/rnacast/supplementary.htmlKeywords
This publication has 24 references indexed in Scilit:
- A comprehensive comparison of comparative RNA structure prediction approachesBMC Bioinformatics, 2004
- Abstract shapes of RNANucleic Acids Research, 2004
- Pure multiple RNA secondary structure alignments: a progressive profile approachIEEE/ACM Transactions on Computational Biology and Bioinformatics, 2004
- Evaluation of the suitability of free-energy minimization using nearest-neighbor energy parameters for RNA secondary structure predictionBMC Bioinformatics, 2004
- Rfam: an RNA family databaseNucleic Acids Research, 2003
- Secondary Structure Prediction for Aligned RNA SequencesJournal of Molecular Biology, 2002
- An Extensive Class of Small RNAs in Caenorhabditis elegansScience, 2001
- Finding the most significant common sequence and structure motifs in a set of RNA sequencesNucleic Acids Research, 1997
- Fast folding and comparison of RNA secondary structuresMonatshefte für Chemie / Chemical Monthly, 1994
- Identifying constraints on the higher-order structure of RNA: continued development and application of comparative sequence analysis methodsNucleic Acids Research, 1992