Genome‐wide significance for dense SNP and resequencing data
- 16 January 2008
- journal article
- research article
- Published by Wiley in Genetic Epidemiology
- Vol. 32 (2) , 179-185
- https://doi.org/10.1002/gepi.20292
Abstract
The problem of multiple testing is an important aspect of genome‐wide association studies, and will become more important as marker densities increase. The problem has been tackled with permutation and false discovery rate procedures and with Bayes factors, but each approach faces difficulties that we briefly review. In the current context of multiple studies on different genotyping platforms, we argue for the use of truly genome‐wide significance thresholds, based on all polymorphisms whether or not typed in the study. We approximate genome‐wide significance thresholds in contemporary West African, East Asian and European populations by simulating sequence data, based on all polymorphisms as well as for a range of single nucleotide polymorphism (SNP) selection criteria. Overall we find that significance thresholds vary by a factor of >20 over the SNP selection criteria and statistical tests that we consider and can be highly dependent on sample size. We compare our results for sequence data to those derived by the HapMap Consortium and find notable differences which may be due to the small sample sizes used in the HapMap estimate. Genet. Epidemiol. 32:179–185, 2008.Keywords
This publication has 22 references indexed in Scilit:
- Sequence-Level Population Simulations Over Large Genomic RegionsGenetics, 2007
- Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controlsNature, 2007
- A Fast Method for Computing High-Significance Disease Association in Large Population-Based StudiesAmerican Journal of Human Genetics, 2006
- A haplotype map of the human genomeNature, 2005
- Calibrating a coalescent simulation of human genome sequence variationGenome Research, 2005
- Evaluation of Nyholt’s Procedure for Multiple Testing CorrectionHuman Heredity, 2005
- Efficient Computation of Significance Levels for Multiple Associations in Large Studies of Correlated Data, Including Genomewide Association StudiesAmerican Journal of Human Genetics, 2004
- Statistical significance for genomewide studiesProceedings of the National Academy of Sciences, 2003
- A high-resolution recombination map of the human genomeNature Genetics, 2002
- Genetic dissection of complex traits: guidelines for interpreting and reporting linkage resultsNature Genetics, 1995