A new DNA sequence assembly program
- 1 January 1995
- journal article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 23 (24) , 4992-4999
- https://doi.org/10.1093/nar/23.24.4992
Abstract
We describe the Genome Assembly Program (GAP), a new program for DNA sequence assembly. The program is suitable for large and small projects, a variety of strategies and can handle data from a range of sequencing instruments. It retains the useful components of our previous work, but includes many novel ideas and methods. Many of these methods have been made possible by the program's completely new, and highly interactive, graphical user interface. The program provides many visual clues to the current state of a sequencing project and allows users to interact in intuitive and graphical ways with their data. The program has tools to display and manipulate the various types of data that help to solve and check difficult assemblies, particularly those in repetitive genomes. We have introduced the following new displays: the Contig Selector, the Contig Comparator, the Template Display, the Restriction Enzyme Map and the Stop Codon Map. We have also made it possible to have any number of Contig Editors and Contig Joining Editors running simultaneously even on the same contig. The program also includes a new 'Directed Assembly' algorithm and routines for automatically detecting unfinished segments of sequence, to which it suggests experimental solutions.Keywords
This publication has 16 references indexed in Scilit:
- The application of numerical estimates of base calling accuracy to DNA sequencing projectsNucleic Acids Research, 1995
- The Genome Reconstruction Manager: A Software Environment for Supporting High-Throughput DNA SequencingGenomics, 1994
- A global approach for contig constructionBioinformatics, 1994
- On global sequence alignmentBioinformatics, 1994
- Using the FASTA Program to Search Protein and DNA Sequence DatabasesPublished by Springer Nature ,1994
- High throughput DNA sequencing using an automated electrophoresis analysis system and a novel sequence assembly program.1993
- DNA sequencing with dye-labeled terminators and T7 DNA polymerase: effect of dyes and dNTPs on incorporation of dye-terminators and probability analysis of termination fragmentsNucleic Acids Research, 1992
- A standard file format for data from DNA sequencing instrumentsDNA Sequence, 1992
- OSP: a computer program for choosing PCR and DNA sequencing primers.Genome Research, 1991
- A sequence assembly and editing program for efficient management of large projectsNucleic Acids Research, 1991