Sequence-specific error profile of Illumina sequencers
Top Cited Papers
Open Access
- 14 May 2011
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 39 (13) , e90
- https://doi.org/10.1093/nar/gkr344
Abstract
We identified the sequence-specific starting positions of consecutive miscalls in the mapping of reads obtained from the Illumina Genome Analyser (GA). Detailed analysis of the miscall pattern indicated that the underlying mechanism involves sequence-specific interference of the base elongation process during sequencing. The two major sequence patterns that trigger this sequence-specific error (SSE) are: (i) inverted repeats and (ii) GGC sequences. We speculate that these sequences favor dephasing by inhibiting single-base elongation, by: (i) folding single-stranded DNA and (ii) altering enzyme preference. This phenomenon is a major cause of sequence coverage variability and of the unfavorable bias observed for population-targeted methods such as RNA-seq and ChIP-seq. Moreover, SSE is a potential cause of false single-nucleotide polymorphism (SNP) calls and also significantly hinders de novo assembly. This article highlights the importance of recognizing SSE and its underlying mechanisms in the hope of enhancing the potential usefulness of the Illumina sequencers.Keywords
This publication has 39 references indexed in Scilit:
- Whole-genome sequencing and comprehensive variant analysis of a Japanese individual using massively parallel sequencingNature Genetics, 2010
- The sequence and de novo assembly of the giant panda genomeNature, 2009
- Sequencing technologies — the next generationNature Reviews Genetics, 2009
- Computation for ChIP-seq and RNA-seq studiesNature Methods, 2009
- RNA-Seq: a revolutionary tool for transcriptomicsNature Reviews Genetics, 2009
- A large genome center's improvements to the Illumina sequencing systemNature Methods, 2008
- Accurate whole human genome sequencing using reversible terminator chemistryNature, 2008
- Applied Biosystems SOLiD™ System: Ligation‐Based SequencingPublished by Wiley ,2008
- Genome sequencing in microfabricated high-density picolitre reactorsNature, 2005
- Solexa LtdPharmacogenomics, 2004