Direct Comparisons of Illumina vs. Roche 454 Sequencing Technologies on the Same Microbial Community DNA Sample
Top Cited Papers
Open Access
- 10 February 2012
- journal article
- research article
- Published by Public Library of Science (PLoS) in PLOS ONE
- Vol. 7 (2) , e30087
- https://doi.org/10.1371/journal.pone.0030087
Abstract
Next-generation sequencing (NGS) is commonly used in metagenomic studies of complex microbial communities but whether or not different NGS platforms recover the same diversity from a sample and their assembled sequences are of comparable quality remain unclear. We compared the two most frequently used platforms, the Roche 454 FLX Titanium and the Illumina Genome Analyzer (GA) II, on the same DNA sample obtained from a complex freshwater planktonic community. Despite the substantial differences in read length and sequencing protocols, the platforms provided a comparable view of the community sampled. For instance, derived assemblies overlapped in ∼90% of their total sequences and in situ abundances of genes and genotypes (estimated based on sequence coverage) correlated highly between the two platforms (R2>0.9). Evaluation of base-call error, frameshift frequency, and contig length suggested that Illumina offered equivalent, if not better, assemblies than Roche 454. The results from metagenomic samples were further validated against DNA samples of eighteen isolate genomes, which showed a range of genome sizes and G+C% content. We also provide quantitative estimates of the errors in gene and contig sequences assembled from datasets characterized by different levels of complexity and G+C% content. For instance, we noted that homopolymer-associated, single-base errors affected ∼1% of the protein sequences recovered in Illumina contigs of 10× coverage and 50% G+C; this frequency increased to ∼3% when non-homopolymer errors were also considered. Collectively, our results should serve as a useful practical guide for choosing proper sampling strategies and data possessing protocols for future metagenomic studies.Keywords
This publication has 30 references indexed in Scilit:
- Individual genome assembly from complex community short-read metagenomic datasetsThe ISME Journal, 2011
- Sequence-specific error profile of Illumina sequencersNucleic Acids Research, 2011
- Microbial community transcriptomes reveal microbes and metabolic pathways associated with dissolved organic matter turnover in the seaProceedings of the National Academy of Sciences, 2010
- FragGeneScan: predicting genes in short and error-prone readsNucleic Acids Research, 2010
- Impact of diet in shaping gut microbiota revealed by a comparative study in children from Europe and rural AfricaProceedings of the National Academy of Sciences, 2010
- Alta-Cyclic: a self-optimizing base caller for next-generation sequencingNature Methods, 2008
- MetaGene: prokaryotic gene finding from environmental genome shotgun sequencesNucleic Acids Research, 2006
- Multiple Sequence Alignment Using ClustalW and ClustalXCurrent Protocols in Bioinformatics, 2002
- BLAT—The BLAST-Like Alignment ToolGenome Research, 2002
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997