Error correction of high-throughput sequencing datasets with non-uniform coverage
Open Access
- 14 June 2011
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 27 (13) , i137-i141
- https://doi.org/10.1093/bioinformatics/btr208
Abstract
Motivation: The continuing improvements to high-throughput sequencing (HTS) platforms have begun to unfold a myriad of new applications. As a result, error correction of sequencing reads remains an important problem. Though several tools do an excellent job of correcting datasets where the reads are sampled close to uniformly, the problem of correcting reads coming from drastically non-uniform datasets, such as those from single-cell sequencing, remains open. Results: In this article, we develop the method Hammer for error correction without any uniformity assumptions. Hammer is based on a combination of a Hamming graph and a simple probabilistic model for sequencing errors. It is a simple and adaptable algorithm that improves on other tools on non-uniform single-cell data, while achieving comparable results on normal multi-cell data. Availability:http://www.cs.toronto.edu/~pashadag. Contact:pmedvedev@cs.ucsd.eduKeywords
This publication has 24 references indexed in Scilit:
- EDAR: An Efficient Error Detection and Removal Algorithm for Next Generation Sequencing DataJournal of Computational Biology, 2010
- Correction of sequencing errors in a mixed set of readsBioinformatics, 2010
- A Parallel Algorithm for Error Correction in High-Throughput Short-Read Data on CUDA-Enabled Graphics HardwareJournal of Computational Biology, 2010
- Genome 10K: A Proposal to Obtain Whole-Genome Sequence for 10 000 Vertebrate SpeciesJournal of Heredity, 2009
- SHREC: a short-read error correction methodBioinformatics, 2009
- Efficient frequency-based de novo short-read clustering for error trimming in next-generation sequencingGenome Research, 2009
- Ultrafast and memory-efficient alignment of short DNA sequences to the human genomeGenome Biology, 2009
- Velvet: Algorithms for de novo short read assembly using de Bruijn graphsGenome Research, 2008
- Short read fragment assembly of bacterial genomesGenome Research, 2007
- Metagenomic Analysis of the Human Distal Gut MicrobiomeScience, 2006