Assemblathon 1: A competitive assessment of de novo short read assembly methods
Top Cited Papers
Open Access
- 16 September 2011
- journal article
- research article
- Published by Cold Spring Harbor Laboratory in Genome Research
- Vol. 21 (12) , 2224-2241
- https://doi.org/10.1101/gr.126599.111
Abstract
Low-cost short read sequencing technology has revolutionized genomics, though it is only just becoming practical for the high-quality de novo assembly of a novel large genome. We describe the Assemblathon 1 competition, which aimed to comprehensively assess the state of the art in de novo assembly methods when applied to current sequencing technologies. In a collaborative effort, teams were asked to assemble a simulated Illumina HiSeq data set of an unknown, simulated diploid genome. A total of 41 assemblies from 17 different groups were received. Novel haplotype aware assessments of coverage, contiguity, structure, base calling, and copy number were made. We establish that within this benchmark: (1) It is possible to assemble the genome to a high level of coverage and accuracy, and that (2) large differences exist between the assemblies, suggesting room for further improvements in current methods. The simulated benchmark, including the correct answer, the assemblies, and the code that was used to evaluate the assemblies is now public and freely available from http://www.assemblathon.org/.This publication has 75 references indexed in Scilit:
- Comparative and demographic analysis of orang-utan genomesNature, 2011
- Limitations of next-generation genome sequence assemblyNature Methods, 2010
- Assembly algorithms for next-generation sequencing dataPublished by Elsevier ,2010
- How to map billions of short reads onto genomesNature Biotechnology, 2009
- The draft genome of the transgenic tropical fruit tree papaya (Carica papaya Linnaeus)Nature, 2008
- Bioinformatics challenges of new sequencing technologyPublished by Elsevier ,2008
- Direct electrical detection of DNA synthesisProceedings of the National Academy of Sciences, 2006
- Initial sequencing and comparative analysis of the mouse genomeNature, 2002
- Initial sequencing and analysis of the human genomeNature, 2001
- Basic Local Alignment Search ToolJournal of Molecular Biology, 1990