Performance, Accuracy, and Web Server for Evolutionary Placement of Short Sequence Reads under Maximum Likelihood
Top Cited Papers
Open Access
- 23 March 2011
- journal article
- research article
- Published by Oxford University Press (OUP) in Systematic Biology
- Vol. 60 (3) , 291-302
- https://doi.org/10.1093/sysbio/syr010
Abstract
We present an evolutionary placement algorithm (EPA) and a Web server for the rapid assignment of sequence fragments (short reads) to edges of a given phylogenetic tree under the maximum-likelihood model. The accuracy of the algorithm is evaluated on several real-world data sets and compared with placement by pair-wise sequence comparison, using edit distances and BLAST. We introduce a slow and accurate as well as a fast and less accurate placement algorithm. For the slow algorithm, we develop additional heuristic techniques that yield almost the same run times as the fast version with only a small loss of accuracy. When those additional heuristics are employed, the run time of the more accurate algorithm is comparable with that of a simple BLAST search for data sets with a high number of short query sequences. Moreover, the accuracy of the EPA is significantly higher, in particular when the sample of taxa in the reference topology is sparse or inadequate. Our algorithm, which has been integrated into RAxML, therefore provides an equally fast but more accurate alternative to BLAST for tree-based inference of the evolutionary origin and composition of short sequence reads. We are also actively developing a Web server that offers a freely available service for computing read placements on trees using the EPA.Keywords
This publication has 39 references indexed in Scilit:
- Phymm and PhymmBL: metagenomic phylogenetic classification with interpolated Markov modelsNature Methods, 2009
- A core gut microbiome in obese and lean twinsNature, 2008
- The influence of sex, handedness, and washing on the diversity of hand surface bacteriaProceedings of the National Academy of Sciences, 2008
- A Rapid Bootstrap Algorithm for the RAxML Web ServersSystematic Biology, 2008
- Worlds within worlds: evolution of the vertebrate gut microbiotaNature Reviews Microbiology, 2008
- A detailed analysis of 16S ribosomal RNA gene segments for the diagnosis of pathogenic bacteriaJournal of Microbiological Methods, 2007
- RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed modelsBioinformatics, 2006
- NAST: a multiple sequence alignment server for comparative analysis of 16S rRNA genesNucleic Acids Research, 2006
- Obesity alters gut microbial ecologyProceedings of the National Academy of Sciences, 2005
- Evolutionary trees from DNA sequences: A maximum likelihood approachJournal of Molecular Evolution, 1981