An estimate of the sequencing error frequency in the DNA sequence databases
- 1 January 1992
- journal article
- research article
- Published by Taylor & Francis in DNA Sequence
- Vol. 2 (6) , 343-346
- https://doi.org/10.3109/10425179209020815
Abstract
We have examined vector sequences fortuitously present in the EMBL sequence database as contaminating parts of submitted sequences, and found a sequencing error frequency of 3.55% in this subset of release 27 of the database. We discuss the possibility that this value may be representative for corresponding errors in the database as a whole.Keywords
This publication has 5 references indexed in Scilit:
- Finding DNA Sequencing ErrorsScience, 1991
- Sequence errors described in GenBank: a means to determine the accuracy of DNA sequence interpretationNucleic Acids Research, 1989
- Improved tools for biological sequence comparison.Proceedings of the National Academy of Sciences, 1988
- A comprehensive set of sequence analysis programs for the VAXNucleic Acids Research, 1984
- Rapid similarity searches of nucleic acid and protein data banks.Proceedings of the National Academy of Sciences, 1983