Sample size for FDR-control in microarray data analysis
Open Access
- 21 April 2005
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 21 (14) , 3097-3104
- https://doi.org/10.1093/bioinformatics/bti456
Abstract
Summary: We consider identifying differentially expressing genes between two patient groups using microarray experiment. We propose a sample size calculation method for a specified number of true rejections while controlling the false discovery rate at a desired level. Input parameters for the sample size calculation include the allocation proportion in each group, the number of genes in each array, the number of differentially expressing genes and the effect sizes among the differentially expressing genes. We have a closed-form sample size formula if the projected effect sizes are equal among differentially expressing genes. Otherwise, our method requires a numerical method to solve an equation. Simulation studies are conducted to show that the calculated sample sizes are accurate in practical settings. The proposed method is demonstrated with a realstudy. Contact:jung005@mc.duke.eduKeywords
This publication has 16 references indexed in Scilit:
- Sample size calculation for multiple testing in microarray data analysisBiostatistics, 2004
- Optimal Sample Size for Multiple TestingJournal of the American Statistical Association, 2004
- Power and sample size estimation in high dimensional biologyStatistical Methods in Medical Research, 2004
- Multiple Hypothesis Testing in Microarray ExperimentsStatistical Science, 2003
- Calculation of the minimum number of replicate spots required for detection of significant gene expression fold change in microarray experimentsBioinformatics, 2002
- Power and sample size for DNA microarray studiesStatistics in Medicine, 2002
- Operating Characteristics and Extensions of the False Discovery Rate ProcedureJournal of the Royal Statistical Society Series B: Statistical Methodology, 2002
- Design of studies using DNA microarraysGenetic Epidemiology, 2002
- The control of the false discovery rate in multiple testing under dependencyThe Annals of Statistics, 2001
- Molecular Classification of Cancer: Class Discovery and Class Prediction by Gene Expression MonitoringScience, 1999