Performance comparison of whole-genome sequencing platforms
Top Cited Papers
- 18 December 2011
- journal article
- research article
- Published by Springer Nature in Nature Biotechnology
- Vol. 30 (1) , 78-82
- https://doi.org/10.1038/nbt.2065
Abstract
Over 90% of human whole-genome sequencing has been performed using instruments from two companies, Illumina and Complete Genomics. Lam et al. sequence the same DNA samples with both instruments and compare their performance for calling insertions, deletions and single-nucleotide variants. Whole-genome sequencing is becoming commonplace, but the accuracy and completeness of variant calling by the most widely used platforms from Illumina and Complete Genomics have not been reported. Here we sequenced the genome of an individual with both technologies to a high average coverage of ∼76×, and compared their performance with respect to sequence coverage and calling of single-nucleotide variants (SNVs), insertions and deletions (indels). Although 88.1% of the ∼3.7 million unique SNVs were concordant between platforms, there were tens of thousands of platform-specific calls located in genes and other genomic regions. In contrast, 26.5% of indels were concordant between platforms. Target enrichment validated 92.7% of the concordant SNVs, whereas validation by genotyping array revealed a sensitivity of 99.3%. The validation experiments also suggested that >60% of the platform-specific variants were indeed present in the genome. Our results have important implications for understanding the accuracy and completeness of the genome sequencing platforms.Keywords
This publication has 23 references indexed in Scilit:
- Accurate and comprehensive sequencing of personal genomesGenome Research, 2011
- The variant call format and VCFtoolsBioinformatics, 2011
- Dindel: Accurate indel calls from short-read dataGenome Research, 2010
- A map of human genome variation from population-scale sequencingNature, 2010
- The Genome Analysis Toolkit: A MapReduce framework for analyzing next-generation DNA sequencing dataGenome Research, 2010
- ANNOVAR: functional annotation of genetic variants from high-throughput sequencing dataNucleic Acids Research, 2010
- Clinical assessment incorporating a personal genomePublished by Elsevier ,2010
- Fast and accurate short read alignment with Burrows–Wheeler transformBioinformatics, 2009
- The complete genome of an individual by massively parallel DNA sequencingNature, 2008
- Initial sequencing and analysis of the human genomeNature, 2001