Whole-Genome Sequence Assembly for Mammalian Genomes: Arachne 2
Open Access
- 1 January 2003
- journal article
- Published by Cold Spring Harbor Laboratory in Genome Research
- Vol. 13 (1) , 91-96
- https://doi.org/10.1101/gr.828403
Abstract
We previously described the whole-genome assembly program Arachne, presenting assemblies of simulated data for small to mid-sized genomes. Here we describe algorithmic adaptations to the program, allowing for assembly of mammalian-size genomes, and also improving the assembly of smaller genomes. Three principal changes were simultaneously made and applied to the assembly of the mouse genome, during a six-month period of development: (1) Supercontigs (scaffolds) were iteratively broken and rejoined using several criteria, yielding a 64-fold increase in length (N50), and apparent elimination of all global misjoins; (2) gaps between contigs in supercontigs were filled (partially or completely) by insertion of reads, as suggested by pairing within the supercontig, increasing the N50 contig length by 50%; (3) memory usage was reduced fourfold. The outcome of this mouse assembly and its analysis are described in (Mouse Genome Sequencing Consortium 2002).Keywords
This publication has 17 references indexed in Scilit:
- The Phusion AssemblerGenome Research, 2002
- Whole-Genome Shotgun Assembly and Analysis of the Genome of Fugu rubripesScience, 2002
- A Comparison of Whole-Genome Shotgun-Derived Mouse Chromosome 16 and the Human GenomeScience, 2002
- A Draft Sequence of the Rice Genome ( Oryza sativa L. ssp. indica )Science, 2002
- Mouse BAC Ends Quality Assessment and Sequence AnalysesGenome Research, 2001
- The Sequence of the Human GenomeScience, 2001
- A Whole-Genome Assembly of DrosophilaScience, 2000
- A comprehensive genetic map of the mouse genomeNature, 1996
- Whole-Genome Random Sequencing and Assembly of Haemophilus influenzae RdScience, 1995
- Nucleotide sequence of bacteriophage λ DNAJournal of Molecular Biology, 1982