Accurate and efficient reconstruction of deep phylogenies from structured RNAs
Open Access
- 1 September 2009
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 37 (18) , 6184-6193
- https://doi.org/10.1093/nar/gkp600
Abstract
Ribosomal RNA (rRNA) genes are probably the most frequently used data source in phylogenetic reconstruction. Individual columns of rRNA alignments are not independent as a consequence of their highly conserved secondary structures. Unless explicitly taken into account, these correlation can distort the phylogenetic signal and/or lead to gross overestimates of tree stability. Maximum likelihood and Bayesian approaches are of course amenable to using RNA-specific substitution models that treat conserved base pairs appropriately, but require accurate secondary structure models as input. So far, however, no accurate and easy-to-use tool has been available for computing structure-aware alignments and consensus structures that can deal with the large rRNAs. The RNAsalsa approach is designed to fill this gap. Capitalizing on the improved accuracy of pairwise consensus structures and informed by a priori knowledge of group-specific structural constraints, the tool provides both alignments and consensus structures that are of sufficient accuracy for routine phylogenetic analysis based on RNA-specific substitution models. The power of the approach is demonstrated using two rRNA data sets: a mitochondrial rRNA set of 26 Mammalia, and a collection of 28S nuclear rRNAs representative of the five major echinoderm groups.Keywords
This publication has 70 references indexed in Scilit:
- RNAalifold: improved consensus structure prediction for RNA alignmentsBMC Bioinformatics, 2008
- Ab initio RNA folding by discrete molecular dynamics: From structure prediction to folding mechanismsRNA, 2008
- Improved accuracy of multiple ncRNA alignment by incorporating structural information into a MAFFT-based frameworkBMC Bioinformatics, 2008
- Strategies for measuring evolutionary conservation of RNA secondary structuresBMC Bioinformatics, 2008
- A fast structural multiple alignment method for long RNA sequencesBMC Bioinformatics, 2008
- Inferring Noncoding RNA Families and Classes by Means of Genome-Scale Structure-Based ClusteringPLoS Computational Biology, 2007
- The complete mitochondrial genomes of the sea lily Gymnocrinus richeri and the feather star Phanogenia gracilis: Signature nucleotide bias and unique nad4L gene rearrangement within crinoidsMolecular Phylogenetics and Evolution, 2006
- On the Correlation Between Composition and Site-Specific Evolutionary Rate: Implications for Phylogenetic InferenceMolecular Biology and Evolution, 2005
- Echinoderm Larvae and PhylogenyAnnual Review of Ecology and Systematics, 1997
- CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choiceNucleic Acids Research, 1994