Automated Sequence Preprocessing in a Large-Scale Sequencing Environment
Open Access
- 1 September 1998
- journal article
- Published by Cold Spring Harbor Laboratory in Genome Research
- Vol. 8 (9) , 975-984
- https://doi.org/10.1101/gr.8.9.975
Abstract
A software system for transforming fragments from four-color fluorescence-based gel electrophoresis experiments into assembled sequence is described. It has been developed for large-scale processing of all trace data, including shotgun and finishing reads, regardless of clone origin. Design considerations are discussed in detail, as are programming implementation and graphic tools. The importance of input validation, record tracking, and use of base quality values is emphasized. Several quality analysis metrics are proposed and applied to sample results from recently sequenced clones. Such quantities prove to be a valuable aid in evaluating modifications of sequencing protocol. The system is in full production use at both the Genome Sequencing Center and the Sanger Centre, for which combined weekly production is ∼100,000 sequencing reads per week.Keywords
This publication has 16 references indexed in Scilit:
- Lane tracking software for four-color fluorescence-based electrophoretic gel images.Genome Research, 1996
- The staden sequence analysis packageMolecular Biotechnology, 1996
- NIH Launches the Final Push to Sequence the GenomeScience, 1996
- Experiment files and their application during large-scale sequencing projectsDNA Sequence, 1996
- A new DNA sequence assembly programNucleic Acids Research, 1995
- The Genome Reconstruction Manager: A Software Environment for Supporting High-Throughput DNA SequencingGenomics, 1994
- 2.2 Mb of contiguous nucleotide sequence from chromosome III of C. elegansNature, 1994
- A standard file format for data from DNA sequencing instrumentsDNA Sequence, 1992
- A trace display and editing program for data from fluorescence based sequencing machinesNucleic Acids Research, 1991
- The Human Genome Project: Past, Present, and FutureScience, 1990