Sample size determination for the false discovery rate
Open Access
- 4 October 2005
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 21 (23) , 4263-4271
- https://doi.org/10.1093/bioinformatics/bti699
Abstract
Motivation: There is not a widely applicable method to determine the sample size for experiments basing statistical significance on the false discovery rate (FDR). Results: We propose and develop the anticipated FDR (aFDR) as a conceptual tool for determining sample size. We derive mathematical expressions for the aFDR and anticipated average statistical power. These expressions are used to develop a general algorithm to determine sample size. We provide specific details on how to implement the algorithm for a k-group (k ≥ 2) comparisons. The algorithm performs well for k-group comparisons in a series of traditional simulations and in a real-data simulation conducted by resampling from a large, publicly available dataset. Availability: Documented S-plus and R code libraries are freely available from Contact:stanley.pounds@stjude.org Supplementary information: Supplementary data are available at Bioinformatics online.Keywords
This publication has 28 references indexed in Scilit:
- Practical FDR-based sample size calculations in microarray experimentsBioinformatics, 2005
- Sample size for FDR-control in microarray data analysisBioinformatics, 2005
- Optimal Sample Size for Multiple TestingJournal of the American Statistical Association, 2004
- Improving false discovery rate estimationBioinformatics, 2004
- Statistical significance for genomewide studiesProceedings of the National Academy of Sciences, 2003
- Estimating the occurrence of false positives and false negatives in microarray studies by approximating and partitioning the empirical distribution of p-valuesBioinformatics, 2003
- Identifying differentially expressed genes using false discovery rate controlling proceduresBioinformatics, 2003
- A Direct Approach to False Discovery RatesJournal of the Royal Statistical Society Series B: Statistical Methodology, 2002
- The control of the false discovery rate in multiple testing under dependencyThe Annals of Statistics, 2001
- On the Adaptive Control of the False Discovery Rate in Multiple Testing With Independent StatisticsJournal of Educational and Behavioral Statistics, 2000