PARTS: Probabilistic Alignment for RNA joinT Secondary structure prediction
Open Access
- 26 February 2008
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 36 (7) , 2406-2417
- https://doi.org/10.1093/nar/gkn043
Abstract
A novel method is presented for joint prediction of alignment and common secondary structures of two RNA sequences. The joint consideration of common secondary structures and alignment is accomplished by structural alignment over a search space defined by the newly introduced motif called matched helical regions. The matched helical region formulation generalizes previously employed constraints for structural alignment and thereby better accommodates the structural variability within RNA families. A probabilistic model based on pseudo free energies obtained from precomputed base pairing and alignment probabilities is utilized for scoring structural alignments. Maximum a posteriori (MAP) common secondary structures, sequence alignment and joint posterior probabilities of base pairing are obtained from the model via a dynamic programming algorithm called PARTS. The advantage of the more general structural alignment of PARTS is seen in secondary structure predictions for the RNase P family. For this family, the PARTS MAP predictions of secondary structures and alignment perform significantly better than prior methods that utilize a more restrictive structural alignment model. For the tRNA and 5S rRNA families, the richer structural alignment model of PARTS does not offer a benefit and the method therefore performs comparably with existing alternatives. For all RNA families studied, the posterior probability estimates obtained from PARTS offer an improvement over posterior probability estimates from a single sequence prediction. When considering the base pairings predicted over a threshold value of confidence, the combination of sensitivity and positive predictive value is superior for PARTS than for the single sequence prediction. PARTS source code is available for download under the GNU public license at http://rna.urmc.rochester.edu .Keywords
This publication has 29 references indexed in Scilit:
- Thousands of corresponding human and mouse genomic regions unalignable in primary sequence contain common RNA structureGenome Research, 2006
- Detection of non-coding RNAs on the basis of predicted secondary structure formation free energy changeBMC Bioinformatics, 2006
- Fast and reliable prediction of noncoding RNAsProceedings of the National Academy of Sciences, 2005
- Simultaneous alignment and structure prediction of three RNA sequencesInternational Journal of Bioinformatics Research and Applications, 2005
- Evaluation of several lightweight stochastic context-free grammars for RNA secondary structure predictionBMC Bioinformatics, 2004
- Incorporating chemical modification constraints into a dynamic programming algorithm for prediction of RNA secondary structureProceedings of the National Academy of Sciences, 2004
- Mfold web server for nucleic acid folding and hybridization predictionNucleic Acids Research, 2003
- Vienna RNA secondary structure serverNucleic Acids Research, 2003
- Computational Genomics of Noncoding RNA GenesCell, 2002
- Non–coding RNA genes and the modern RNA worldNature Reviews Genetics, 2001