Base-Calling of Automated Sequencer Traces UsingPhred. I. Accuracy Assessment
Open Access
- 1 March 1998
- journal article
- Published by Cold Spring Harbor Laboratory in Genome Research
- Vol. 8 (3) , 175-185
- https://doi.org/10.1101/gr.8.3.175
Abstract
The availability of massive amounts of DNA sequence information has begun to revolutionize the practice of biology. As a result, current large-scale sequencing output, while impressive, is not adequate to keep pace with growing demand and, in particular, is far short of what will be required to obtain the 3-billion-base human genome sequence by the target date of 2005. To reach this goal, improved automation will be essential, and it is particularly important that human involvement in sequence data processing be significantly reduced or eliminated. Progress in this respect will require both improved accuracy of the data processing software and reliable accuracy measures to reduce the need for human involvement in error correction and make human review more efficient. Here, we describe one step toward that goal: a base-calling program for automated sequencer traces,phred,with improved accuracy.phredappears to be the first base-calling program to achieve a lower error rate than the ABI software, averaging 40%–50% fewer errors in the data sets examined independent of position in read, machine running conditions, or sequencing chemistry.Keywords
This publication has 14 references indexed in Scilit:
- A rapid method for determining sequences in DNA by primed synthesis with DNA polymerasePublished by Elsevier ,2004
- Base-Calling of Automated Sequencer Traces Using Phred. II. Error ProbabilitiesGenome Research, 1998
- AmpliTaq ® DNA Polymerase, FS Dye-Terminator Sequencing: Analysis of Peak Height PatternsBioTechniques, 1996
- A graph theoretic approach to the analysis of DNA sequencing data.Genome Research, 1996
- An adaptive, object oriented strategy for base calling in DNA sequence analysisNucleic Acids Research, 1993
- DNA sequencing with dye-labeled terminators and T7 DNA polymerase: effect of dyes and dNTPs on incorporation of dye-terminators and probability analysis of termination fragmentsNucleic Acids Research, 1992
- A standard file format for data from DNA sequencing instrumentsDNA Sequence, 1992
- A System for Rapid DNA Sequencing with Fluorescent Chain-Terminating DideoxynucleotidesScience, 1987
- Numerical recipes: the art of scientific computingAnalytica Chimica Acta, 1987
- DNA sequencing with chain-terminating inhibitorsProceedings of the National Academy of Sciences, 1977