A Software System for Data Analysis in Automated DNA Sequencing
Open Access
- 1 June 1998
- journal article
- Published by Cold Spring Harbor Laboratory in Genome Research
- Vol. 8 (6) , 644-665
- https://doi.org/10.1101/gr.8.6.644
Abstract
Software for gel image analysis and base-calling in fluorescence-based sequencing consisting of two primary programs, BaseFinder and GelImager, is described. BaseFinder is a framework for trace processing, analysis, and base-calling. BaseFinder is highly extensible, allowing the addition of trace analysis and processing modules without recompilation. Powerful scripting capabilities combined with modularity and multilane handling allow the user to customize BaseFinder to virtually any type of trace processing. We have developed an extensive set of data processing and analysis modules for use with the program in fluorescence-based sequencing. GelImager is a framework for gel image manipulation. It can be used for gel visualization, lane retracking, and as a front end to the Washington University Getlanes program. The programs were designed using a cross-platform development environment, currently allowing them to run in Windows NT, Windows 95, Openstep/Mach, and Rhapsody. Work is ongoing to deploy the software on additional platforms, including Solaris, Linux, and MacOS. This software has been thoroughly tested and debugged in the analysis of >2 million bp of raw sequence data from human chromosome 19 region q13. Overall sequencing accuracy was measured using a significant subset of these data, consisting of ∼600 sequences, by comparing the individual shotgun sequences against the final assembled contigs. Also, results are reported from experiments that analyzed the accuracy of the software and two other well-known base-calling programs for sequencing the M13mp18 vector sequence.[The sequence data described in this paper have been submitted to the GenBank data library under accession no. AF025422]Keywords
This publication has 27 references indexed in Scilit:
- Fully Automated DNA Reaction and Analysis in a Fluidic Capillary InstrumentAnalytical Chemistry, 1997
- A method to determine the filter matrix in four‐dye fluorescence‐based DNA sequencingElectrophoresis, 1997
- A graph theoretic approach to the analysis of DNA sequencing data.Genome Research, 1996
- Automatic matrix determination in four dye fluorescence-based DNA sequencingElectrophoresis, 1996
- An automated film reader for DNA sequencing based on homomorphic deconvolutionIEEE Transactions on Biomedical Engineering, 1994
- Quantitative analysis of gel electrophoretograms by image analysis and least squares modelingElectrophoresis, 1993
- High-throughput DNA preparation systemGenetic Analysis: Biomolecular Engineering, 1992
- High Speed Automated DNA Sequencing in Ultrathin Slab GelsNature Biotechnology, 1992
- Development of an automated procedure for fluorescent DNA sequencingGenomics, 1990
- Mapping and Sequencing the Human Genome: How to ProceedNature Biotechnology, 1987