Stampy: A statistical algorithm for sensitive and fast mapping of Illumina sequence reads
Top Cited Papers
- 27 October 2010
- journal article
- Published by Cold Spring Harbor Laboratory in Genome Research
- Vol. 21 (6) , 936-939
- https://doi.org/10.1101/gr.111120.110
Abstract
High-volume sequencing of DNA and RNA is now within reach of any research laboratory and is quickly becoming established as a key research tool. In many workflows, each of the short sequences (“reads”) resulting from a sequencing run are first “mapped” (aligned) to a reference sequence to infer the read from which the genomic location derived, a challenging task because of the high data volumes and often large genomes. Existing read mapping software excel in either speed (e.g., BWA, Bowtie, ELAND) or sensitivity (e.g., Novoalign), but not in both. In addition, performance often deteriorates in the presence of sequence variation, particularly so for short insertions and deletions (indels). Here, we present a read mapper, Stampy, which uses a hybrid mapping algorithm and a detailed statistical model to achieve both speed and sensitivity, particularly when reads include sequence variation. This results in a higher useable sequence yield and improved accuracy compared to that of existing software.Keywords
This publication has 9 references indexed in Scilit:
- A map of human genome variation from population-scale sequencingNature, 2010
- Sequencing technologies — the next generationNature Reviews Genetics, 2009
- SOAP2: an improved ultrafast tool for short read alignmentBioinformatics, 2009
- Fast and accurate short read alignment with Burrows–Wheeler transformBioinformatics, 2009
- Ultrafast and memory-efficient alignment of short DNA sequences to the human genomeGenome Biology, 2009
- RNA-Seq: a revolutionary tool for transcriptomicsNature Reviews Genetics, 2009
- Mapping short DNA sequencing reads and calling variants using mapping quality scoresGenome Research, 2008
- Pyrobayes: an improved base caller for SNP discovery in pyrosequencesNature Methods, 2008
- SNPdetector: A Software Tool for Sensitive and Accurate SNP DetectionPLoS Computational Biology, 2005