Universal False Discovery Rate Estimation Methodology for Genome-Wide Association Studies
- 11 December 2007
- journal article
- Published by S. Karger AG in Human Heredity
- Vol. 65 (4) , 183-194
- https://doi.org/10.1159/000112365
Abstract
Genome-wide case-control association studies aim at identifying significant differential markers between sick and healthy populations. With the development of large-scale technologies allowing the genotyping of thousands of single nucleotide polymorphisms (SNPs) comes the multiple testing problem and the practical issue of selecting the most probable set of associated markers. Several False Discovery Rate (FDR) estimation methods have been developed and tuned mainly for differential gene expression studies. However they are based on hypotheses and designs that are not necessarily relevant in genetic association studies. In this article we present a universal methodology to estimate the FDR of genome-wide association results. It uses a single global probability value per SNP and is applicable in practice for any study design, using any statistic. We have benchmarked this algorithm on simulated data and shown that it outperforms previous methods in cases requiring non-parametric estimation. We exemplified the usefulness of the method by applying it to the analysis of experimental genotyping data of three Multiple Sclerosis case-control association studies.Keywords
This publication has 26 references indexed in Scilit:
- Estimatingp-values in small microarray experimentsBioinformatics, 2006
- A tutorial on statistical methods for population association studiesNature Reviews Genetics, 2006
- A Fast, Unbiased and Exact Allelic Test for Case-Control Association StudiesHuman Heredity, 2006
- Robust estimation of the false discovery rateBioinformatics, 2006
- Construction of null statistics in permutation-based multiple testing for multi-factorial microarray experimentsBioinformatics, 2006
- Complement Factor H Polymorphism in Age-Related Macular DegenerationScience, 2005
- Large-Scale Simultaneous Hypothesis TestingJournal of the American Statistical Association, 2004
- Statistical significance for genomewide studiesProceedings of the National Academy of Sciences, 2003
- A Direct Approach to False Discovery RatesJournal of the Royal Statistical Society Series B: Statistical Methodology, 2002
- Use of Unlinked Genetic Markers to Detect Population Stratification in Association StudiesAmerican Journal of Human Genetics, 1999