Graph accordance of next-generation sequence assemblies
Open Access
- 23 October 2011
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 28 (1) , 13-16
- https://doi.org/10.1093/bioinformatics/btr588
Abstract
Motivation: No individual assembly algorithm addresses all the known limitations of assembling short-length sequences. Overall reduced sequence contig length is the major problem that challenges the usage of these assemblies. We describe an algorithm to take advantages of different assembly algorithms or sequencing platforms to improve the quality of next-generation sequence (NGS) assemblies. Results: The algorithm is implemented as a graph accordance assembly (GAA) program. The algorithm constructs an accordance graph to capture the mapping information between the target and query assemblies. Based on the accordance graph, the contigs or scaffolds of the target assembly can be extended, merged or bridged together. Extra constraints, including gap sizes, mate pairs, scaffold order and orientation, are explored to enforce those accordance operations in the correct context. We applied GAA to various chicken NGS assemblies and the results demonstrate improved contiguity statistics and higher genome and gene coverage. Availability: GAA is implemented in OO perl and is available here: http://sourceforge.net/projects/gaa-wugi/. Contact:lye@genome.wustl.eduKeywords
This publication has 17 references indexed in Scilit:
- High-quality draft assemblies of mammalian genomes from massively parallel sequence dataProceedings of the National Academy of Sciences, 2010
- Limitations of next-generation genome sequence assemblyNature Methods, 2010
- Integrating genome assemblies with MAIABioinformatics, 2010
- Optimization of de novo transcriptome assembly from next-generation sequencing dataGenome Research, 2010
- Iterative Correction of Reference Nucleotides (iCORN) using second generation sequencing technologyBioinformatics, 2010
- Fast and accurate short read alignment with Burrows–Wheeler transformBioinformatics, 2009
- ABySS: A parallel assembler for short read sequence dataGenome Research, 2009
- Aggressive assembly of pyrosequencing reads with matesBioinformatics, 2008
- Sequence and comparative analysis of the chicken genome provide unique perspectives on vertebrate evolutionNature, 2004
- BLAT—The BLAST-Like Alignment ToolGenome Research, 2002