Scaffolding pre-assembled contigs using SSPACE
Top Cited Papers
- 12 December 2010
- journal article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 27 (4) , 578-579
- https://doi.org/10.1093/bioinformatics/btq683
Abstract
Summary: De novo assembly tools play a main role in reconstructing genomes from next-generation sequencing (NGS) data and usually yield a number of contigs. Using paired-read sequencing data it is possible to assess the order, distance and orientation of contigs and combine them into so-called scaffolds. Although the latter process is a crucial step in finishing genomes, scaffolding algorithms are often built-in functions in de novo assembly tools and cannot be independently controlled. We here present a new tool, called SSPACE, which is a stand-alone scaffolder of pre-assembled contigs using paired-read data. Main features are: a short runtime, multiple library input of paired-end and/or mate pair datasets and possible contig extension with unmapped sequence reads. SSPACE shows promising results on both prokaryote and eukaryote genomic testsets where the amount of initial contigs was reduced by at least 75%. Availability: www.baseclear.com/bioinformatics-tools/. Contact: walter.pirovano@baseclear.com Supplementary information: Supplementary data are available at Bioinformatics online.Keywords
This publication has 6 references indexed in Scilit:
- The sequence and de novo assembly of the giant panda genomeNature, 2009
- De novo genome sequence assembly of a filamentous fungus using Sanger, 454 and Illumina sequence dataGenome Biology, 2009
- Ultrafast and memory-efficient alignment of short DNA sequences to the human genomeGenome Biology, 2009
- ABySS: A parallel assembler for short read sequence dataGenome Research, 2009
- Assembling millions of short DNA sequences using SSAKEBioinformatics, 2006
- Hierarchical Scaffolding With BambusGenome Research, 2004