Efficient study design for next generation sequencing
- 2 March 2011
- journal article
- research article
- Published by Wiley in Genetic Epidemiology
- Vol. 35 (4) , 269-277
- https://doi.org/10.1002/gepi.20575
Abstract
Next Generation Sequencing represents a powerful tool for detecting genetic variation associated with human disease. Because of the high cost of this technology, it is critical that we develop efficient study designs that consider the trade‐off between the number of subjects (n) and the coverage depth (µ). How we divide our resources between the two can greatly impact study success, particularly in pilot studies. We propose a strategy for selecting the optimal combination of n and µ for studies aimed at detecting rare variants and for studies aimed at detecting associations between rare or uncommon variants and disease. For detecting rare variants, we find the optimal coverage depth to be between 2 and 8 reads when using the likelihood ratio test. For association studies, we find the strategy of sequencing all available subjects to be preferable. In deriving these combinations, we provide a detailed analysis describing the distribution of depth across a genome and the depth needed to identify a minor allele in an individual. The optimal coverage depth depends on the aims of the study, and the chosen depth can have a large impact on study success. Genet. Epidemiol. 35: 269‐277, 2011.Keywords
This publication has 27 references indexed in Scilit:
- Accurate detection and genotyping of SNPs utilizing population sequencing dataGenome Research, 2010
- A SNP discovery method to assess variant allele probability from next-generation resequencing dataGenome Research, 2009
- Sequencing technologies — the next generationNature Reviews Genetics, 2009
- Exome sequencing identifies the cause of a mendelian disorderNature Genetics, 2009
- Bioinformatics approaches for genomics and post genomics applications of next-generation sequencingBriefings in Bioinformatics, 2009
- Evaluation of next generation sequencing platforms for population targeted sequencing studiesGenome Biology, 2009
- Statistical aspects of discerning indel-type structural variation via DNA sequence alignmentBMC Genomics, 2009
- The theory of discovering rare variants via DNA sequencingBMC Genomics, 2009
- 1000 Genomes Project Promises Closer Look at Variation in Human GenomePublished by American Medical Association (AMA) ,2008