Base-Calling of Automated Sequencer Traces Using Phred. II. Error Probabilities
Open Access
- 1 March 1998
- journal article
- Published by Cold Spring Harbor Laboratory in Genome Research
- Vol. 8 (3) , 186-194
- https://doi.org/10.1101/gr.8.3.186
Abstract
Elimination of the data processing bottleneck in high-throughput sequencing will require both improved accuracy of data processing software and reliable measures of that accuracy. We have developed and implemented in our base-calling program phred the ability to estimate a probability of error for each base-call, as a function of certain parameters computed from the trace data. These error probabilities are shown here to be valid (correspond to actual error rates) and to have high power to discriminate correct base-calls from incorrect ones, for read data collected under several different chemistries and electrophoretic conditions. They play a critical role in our assembly program phrap and our finishing programconsed.Keywords
This publication has 7 references indexed in Scilit:
- Base-Calling of Automated Sequencer Traces UsingPhred. I. Accuracy AssessmentGenome Research, 1998
- Consed: A Graphical Tool for Sequence FinishingGenome Research, 1998
- AmpliTaq ® DNA Polymerase, FS Dye-Terminator Sequencing: Analysis of Peak Height PatternsBioTechniques, 1996
- A graph theoretic approach to the analysis of DNA sequencing data.Genome Research, 1996
- Assignment of position-specific error probability to primary DNA sequence dataNucleic Acids Research, 1994
- An adaptive, object oriented strategy for base calling in DNA sequence analysisNucleic Acids Research, 1993
- DNA sequencing with dye-labeled terminators and T7 DNA polymerase: effect of dyes and dNTPs on incorporation of dye-terminators and probability analysis of termination fragmentsNucleic Acids Research, 1992