CAP3: A DNA Sequence Assembly Program
Open Access
- 1 September 1999
- journal article
- Published by Cold Spring Harbor Laboratory in Genome Research
- Vol. 9 (9) , 868-877
- https://doi.org/10.1101/gr.9.9.868
Abstract
We describe the third generation of the CAP sequence assembly program. The CAP3 program includes a number of improvements and new features. The program has a capability to clip 5′ and 3′ low-quality regions of reads. It uses base quality values in computation of overlaps between reads, construction of multiple sequence alignments of reads, and generation of consensus sequences. The program also uses forward–reverse constraints to correct assembly errors and link contigs. Results of CAP3 on four BAC data sets are presented. The performance of CAP3 was compared with that of PHRAP on a number of BAC data sets. PHRAP often produces longer contigs than CAP3 whereas CAP3 often produces fewer errors in consensus sequences than PHRAP. It is easier to construct scaffolds with CAP3 than with PHRAP on low-pass data with forward–reverse constraints.Keywords
This publication has 23 references indexed in Scilit:
- Identification of common molecular subsequencesPublished by Elsevier ,2004
- Fast Comparison of a DNA Sequence with a Protein Sequence DatabaseGenome Science and Technology, 1996
- Combinatorial algorithms for DNA sequence assemblyAlgorithmica, 1995
- The Genome Reconstruction Manager: A Software Environment for Supporting High-Throughput DNA SequencingGenomics, 1994
- A global approach for contig constructionBioinformatics, 1994
- On global sequence alignmentBioinformatics, 1994
- Artificially Generated Data Sets for Testing DNA Sequence Assembly AlgorithmsGenomics, 1993
- Basic Local Alignment Search ToolJournal of Molecular Biology, 1990
- Basic local alignment search toolJournal of Molecular Biology, 1990
- Optimal alignments in linear spaceBioinformatics, 1988