Alta-Cyclic: a self-optimizing base caller for next-generation sequencing
- 6 July 2008
- journal article
- research article
- Published by Springer Nature in Nature Methods
- Vol. 5 (8) , 679-682
- https://doi.org/10.1038/nmeth.1230
Abstract
A new base caller for the Illumina Genome Analyzer uses machine learning to compensate for noise factors and improves accuracy for up to 78-base-pair sequencing reads. Next-generation sequencing is limited to short read lengths and by high error rates. We systematically analyzed sources of noise in the Illumina Genome Analyzer that contribute to these high error rates and developed a base caller, Alta-Cyclic, that uses machine learning to compensate for noise factors. Alta-Cyclic substantially improved the number of accurate reads for sequencing runs up to 78 bases and reduced systematic biases, facilitating confident identification of sequence variants.Keywords
This publication has 10 references indexed in Scilit:
- Shotgun bisulphite sequencing of the Arabidopsis genome reveals DNA methylation patterningNature, 2008
- Whole-genome sequencing and variant discovery in C. elegansNature Methods, 2008
- The year of sequencingNature Methods, 2008
- Human Genetic VariationPublished by American Association for the Advancement of Science (AAAS) ,2007
- Short read fragment assembly of bacterial genomesGenome Research, 2007
- Paired-End Mapping Reveals Extensive Structural Variation in the Human GenomeScience, 2007
- Macronuclear Genome Sequence of the Ciliate Tetrahymena thermophila, a Model EukaryotePLoS Biology, 2006
- Emerging technologies in DNA sequencingGenome Research, 2005
- An analysis of the feasibility of short read sequencingNucleic Acids Research, 2005
- Elimination of Residual Natural Nucleotides from 3′- O -Modified-dNTP Syntheses by Enzymatic Mop-UpBioTechniques, 1998