XRate: a fast prototyping, training and annotation tool for phylo-grammars
Open Access
- 3 October 2006
- journal article
- software
- Published by Springer Nature in BMC Bioinformatics
- Vol. 7 (1) , 428
- https://doi.org/10.1186/1471-2105-7-428
Abstract
Background: Recent years have seen the emergence of genome annotation methods based on the phylo-grammar, a probabilistic model combining continuous-time Markov chains and stochastic grammars. Previously, phylo-grammars have required considerable effort to implement, limiting their adoption by computational biologists. Results: We have developed an open source software tool, xrate, for working with reversible, irreversible or parametric substitution models combined with stochastic context-free grammars. xrate efficiently estimates maximum-likelihood parameters and phylogenetic trees using a novel "phylo-EM" algorithm that we describe. The grammar is specified in an external configuration file, allowing users to design new grammars, estimate rate parameters from training data and annotate multiple sequence alignments without the need to recompile code from source. We have used xrate to measure codon substitution rates and predict protein and RNA secondary structures. Conclusion: Our results demonstrate that xrate estimates biologically meaningful rates and makes predictions whose accuracy is comparable to that of more specialized tools.Keywords
This publication has 82 references indexed in Scilit:
- An RNA gene expressed during cortical development evolved rapidly in humansNature, 2006
- Identification and Classification of Conserved RNA Secondary Structures in the Human GenomePLoS Computational Biology, 2006
- Genome-Wide Identification of Human Functional DNA Using a Neutral Indel ModelPLoS Computational Biology, 2006
- Protein Molecular Function Prediction by Bayesian PhylogenomicsPLoS Computational Biology, 2005
- Evolutionarily conserved elements in vertebrate, insect, worm, and yeast genomesGenome Research, 2005
- A Combined Transmembrane Topology and Signal Peptide Prediction MethodPublished by Elsevier ,2004
- Prediction of complete gene structures in human genomic DNAJournal of Molecular Biology, 1997
- Maximum Discrimination Hidden Markov Models of Sequence ConsensusJournal of Computational Biology, 1995
- Evolutionary trees from DNA sequences: A maximum likelihood approachJournal of Molecular Evolution, 1981
- A simple method for estimating evolutionary rates of base substitutions through comparative studies of nucleotide sequencesJournal of Molecular Evolution, 1980