The Next Generation of Molecular Markers From Massively Parallel Sequencing of Pooled DNA Samples
- 1 September 2010
- journal article
- Published by Oxford University Press (OUP) in Genetics
- Vol. 186 (1) , 207-218
- https://doi.org/10.1534/genetics.110.114397
Abstract
Next generation sequencing (NGS) is about to revolutionize genetic analysis. Currently NGS techniques are mainly used to sequence individual genomes. Due to the high sequence coverage required, the costs for population-scale analyses are still too high to allow an extension to nonmodel organisms. Here, we show that NGS of pools of individuals is often more effective in SNP discovery and provides more accurate allele frequency estimates, even when taking sequencing errors into account. We modify the population genetic estimators Tajima's π and Watterson's θ to obtain unbiased estimates from NGS pooling data. Given the same sequencing effort, the resulting estimators often show a better performance than those obtained from individual sequencing. Although our analysis also shows that NGS of pools of individuals will not be preferable under all circumstances, it provides a cost-effective approach to estimate allele frequencies on a genome-wide scale.Keywords
This publication has 15 references indexed in Scilit:
- Accurate and fast methods to estimate the population mutation rate from error prone sequencesBMC Bioinformatics, 2009
- Mapping Accuracy of Short Reads from Massively Parallel Sequencing and the Implications for Quantitative Expression ProfilingPLOS ONE, 2009
- Detecting SNPs and estimating allele frequencies in clonal bacterial populations by sequencing pooled DNABioinformatics, 2009
- DNA Sudoku—harnessing high-throughput sequencing for multiplexed specimen analysisGenome Research, 2009
- Estimation of Allele Frequencies From High-Coverage Genome-Sequencing ProjectsGenetics, 2009
- Population Genetic Inference From Resequencing DataGenetics, 2009
- Recurrent Positive Selection of the Drosophila Hybrid Incompatibility Gene HmrMolecular Biology and Evolution, 2008
- Estimation of Nucleotide Diversity, Disequilibrium Coefficients, and Mutation Rates from High-Coverage Genome-Sequencing ProjectsMolecular Biology and Evolution, 2008
- Testing for Neutrality in Samples With Sequencing ErrorsGenetics, 2008
- DNA Pooling: a tool for large-scale association studiesNature Reviews Genetics, 2002