Sequence Assembly with CAFTOOLS
Open Access
- 1 March 1998
- journal article
- Published by Cold Spring Harbor Laboratory in Genome Research
- Vol. 8 (3) , 260-267
- https://doi.org/10.1101/gr.8.3.260
Abstract
Large-scale genomic sequencing requires a software infrastructure to support and integrate applications that are not directly compatible. We describe a suite of software tools built around the Common Assembly Format (CAF), a comprehensive representation of a sequence assembly as a text file. These tools form the backbone of sequencing informatics at the Sanger Centre and the Genome Sequencing Center. The CAF format is intentionally flexible, and our Perl and C libraries, which parse and manipulate it, provide powerful tools for creating new applications as well as wrappers to incorporate other software. The tools are available free by anonymous FTP from ftp://ftp.sanger.ac.uk/pub/badger/.Keywords
This publication has 8 references indexed in Scilit:
- Base-Calling of Automated Sequencer Traces Using Phred. II. Error ProbabilitiesGenome Research, 1998
- Consed: A Graphical Tool for Sequence FinishingGenome Research, 1998
- Trace alignment and some of its applications.Bioinformatics, 1998
- Experiment files and their application during large-scale sequencing projectsDNA Sequence, 1996
- A new DNA sequence assembly programNucleic Acids Research, 1995
- A standard file format for data from DNA sequencing instrumentsDNA Sequence, 1992
- Basic Local Alignment Search ToolJournal of Molecular Biology, 1990
- Basic local alignment search toolJournal of Molecular Biology, 1990