Unlocking Short Read Sequencing for Metagenomics
Open Access
- 28 July 2010
- journal article
- research article
- Published by Public Library of Science (PLoS) in PLOS ONE
- Vol. 5 (7) , e11840
- https://doi.org/10.1371/journal.pone.0011840
Abstract
Different high-throughput nucleic acid sequencing platforms are currently available but a trade-off currently exists between the cost and number of reads that can be generated versus the read length that can be achieved. We describe an experimental and computational pipeline yielding millions of reads that can exceed 200 bp with quality scores approaching that of traditional Sanger sequencing. The method combines an automatable gel-less library construction step with paired-end sequencing on a short-read instrument. With appropriately sized library inserts, mate-pair sequences can overlap, and we describe the SHERA software package that joins them to form a longer composite read. This strategy is broadly applicable to sequencing applications that benefit from low-cost high-throughput sequencing, but require longer read lengths. We demonstrate that our approach enables metagenomic analyses using the Illumina Genome Analyzer, with low error rates, and at a fraction of the cost of pyrosequencing.Keywords
This publication has 22 references indexed in Scilit:
- A scalable, fully automated process for construction of sequence-ready barcoded libraries for 454Genome Biology, 2010
- Next-generation sequencing transforms today's biologyNature Methods, 2007
- MEGAN analysis of metagenomic dataGenome Research, 2007
- Magnetic hydrophilic methacrylate-based polymer microspheres for genomic DNA isolationJournal of Chromatography A, 2005
- Regulation of average length of complex PCR productNucleic Acids Research, 1999
- Amplification of cDNA ends based on template-switching effect and step- out PCRNucleic Acids Research, 1999
- Base-Calling of Automated Sequencer Traces Using Phred. II. Error ProbabilitiesGenome Research, 1998
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- An improved PCR method for walking in uncloned genomic DNANucleic Acids Research, 1995
- DNA purification and isolation using a solid-phaseNucleic Acids Research, 1994