Mauve Assembly Metrics
Open Access
- 2 August 2011
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 27 (19) , 2756-2757
- https://doi.org/10.1093/bioinformatics/btr451
Abstract
Summary: High-throughput DNA sequencing technologies have spurred the development of numerous novel methods for genome assembly. With few exceptions, these algorithms are heuristic and require one or more parameters to be manually set by the user. One approach to parameter tuning involves assembling data from an organism with an available high-quality reference genome, and measuring assembly accuracy using some metrics. We developed a system to measure assembly quality under several scoring metrics, and to compare assembly quality across a variety of assemblers, sequence data types, and parameter choices. When used in conjunction with training data such as a high-quality reference genome and sequence reads from the same organism, our program can be used to manually identify an optimal sequencing and assembly strategy for de novo sequencing of related organisms. Availability: GPL source code and a usage tutorial is at http://ngopt.googlecode.com Contact:aarondarling@ucdavis.edu Supplementary information: Supplementary data is available at Bioinformatics online.Keywords
This publication has 8 references indexed in Scilit:
- progressiveMauve: Multiple Genome Alignment with Gene Gain, Loss and RearrangementPLOS ONE, 2010
- The Complete Genome Sequence of Haloferax volcanii DS2, a Model ArchaeonPLOS ONE, 2010
- Reordering contigs of draft genomes using the Mauve AlignerBioinformatics, 2009
- Genome assembly forensics: finding the elusive mis-assemblyGenome Biology, 2008
- A Unifying View of Genome RearrangementsPublished by Springer Nature ,2006
- Mauve: Multiple Alignment of Conserved Genomic Sequence With RearrangementsGenome Research, 2004
- Versatile and open software for comparing large genomesGenome Biology, 2004
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997