An efficient Monte Carlo approach to assessing statistical significance in genomic studies
Open Access
- 28 September 2004
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 21 (6) , 781-787
- https://doi.org/10.1093/bioinformatics/bti053
Abstract
Motivation: Multiple hypothesis testing is a common problem in genome research, particularly in microarray experiments and genomewide association studies. Failure to account for the effects of multiple comparisons would result in an abundance of false positive results. The Bonferroni correction and Holm's step-down procedure are overly conservative, whereas the permutation test is time-consuming and is restricted to simple problems. Results: We developed an efficient Monte Carlo approach to approximating the joint distribution of the test statistics along the genome. We then used the Monte Carlo distribution to evaluate the commonly used criteria for error control, such as familywise error rates and positive false discovery rates. This approach is applicable to any data structures and test statistics. Applications to simulated and real data demonstrate that the proposed approach provides accurate error control, and can be substantially more powerful than the Bonferroni and Holm methods, especially when the test statistics are highly correlated. Contact:lin@bios.unc.eduKeywords
This publication has 13 references indexed in Scilit:
- Assessing genomewide statistical significance in linkage studiesGenetic Epidemiology, 2004
- Rank truncated product of P‐values, with application to genomewide association scansGenetic Epidemiology, 2003
- Statistical significance for genomewide studiesProceedings of the National Academy of Sciences, 2003
- A Direct Approach to False Discovery RatesJournal of the Royal Statistical Society Series B: Statistical Methodology, 2002
- Gene-expression profiles predict survival of patients with lung adenocarcinomaNature Medicine, 2002
- Truncated product method for combiningP‐valuesGenetic Epidemiology, 2002
- The control of the false discovery rate in multiple testing under dependencyThe Annals of Statistics, 2001
- The Robust Inference for the Cox Proportional Hazards ModelJournal of the American Statistical Association, 1989
- Two-Sample Asymptotically Distribution-Free Tests for Incomplete Multivariate ObservationsJournal of the American Statistical Association, 1984
- On closed testing procedures with special reference to ordered analysis of varianceBiometrika, 1976