Parallel, tag-directed assembly of locally derived short sequence reads

Abstract
Short sequence reads are grouped based on the long genomic fragments from which they derive, enabling efficient local assembly of the long fragments and therefore accurate de novo genome assembly and metagenome sequencing. We demonstrate subassembly, an in vitro library construction method that extends the utility of short-read sequencing platforms to applications requiring long, accurate reads. A long DNA fragment library is converted to a population of nested sublibraries, and a tag sequence directs grouping of short reads derived from the same long fragment, enabling localized assembly of long fragment sequences. Subassembly may facilitate accurate de novo genome assembly and metagenome sequencing.