Hierarchical Scaffolding With Bambus
Open Access
- 5 January 2004
- journal article
- research article
- Published by Cold Spring Harbor Laboratory in Genome Research
- Vol. 14 (1) , 149-159
- https://doi.org/10.1101/gr.1536204
Abstract
The output of a genome assembler generally comprises a collection of contiguous DNA sequences (contigs) whose relative placement along the genome is not defined. A procedure called scaffolding is commonly used to order and orient these contigs using paired read information. This ordering of contigs is an essential step when finishing and analyzing the data from a whole-genome shotgun project. Most recent assemblers include a scaffolding module; however, users have little control over the scaffolding algorithm or the information produced. We thus developed a general-purpose scaffolder, called Bambus, which affords users significant flexibility in controlling the scaffolding parameters. Bambus was used recently to scaffold the low-coverage draft dog genome data. Most significantly, Bambus enables the use of linking data other than that inferred from mate-pair information. For example, the sequence of a completed genome can be used to guide the scaffolding of a related organism. We present several applications of Bambus: support for finishing, comparative genomics, analysis of the haplotype structure of genomes, and scaffolding of a mammalian genome at low coverage. Bambus is available as an open-source package from our Web site.Keywords
This publication has 30 references indexed in Scilit:
- The genome sequence of Bacillus anthracis Ames and comparison to closely related bacteriaNature, 2003
- The Phusion AssemblerGenome Research, 2002
- The Draft Genome of Ciona intestinalis : Insights into Chordate and Vertebrate OriginsScience, 2002
- Whole-Genome Shotgun Assembly and Analysis of the Genome of Fugu rubripesScience, 2002
- A Whole-Genome Assembly of DrosophilaScience, 2000
- Optimized Multiplex PCR: Efficiently Closing a Whole-Genome Shotgun Sequencing ProjectGenomics, 1999
- Whole-Genome Random Sequencing and Assembly of Haemophilus influenzae RdScience, 1995
- Combinatorial algorithms for DNA sequence assemblyAlgorithmica, 1995
- TIGR Assembler: A New Tool for Assembling Large Shotgun Sequencing ProjectsGenome Science and Technology, 1995
- Nucleotide sequence of bacteriophage λ DNAJournal of Molecular Biology, 1982