Practical FDR-based sample size calculations in microarray experiments
Open Access
- 2 June 2005
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 21 (15) , 3264-3272
- https://doi.org/10.1093/bioinformatics/bti519
Abstract
Motivation: Owing to the experimental cost and difficulty in obtaining biological materials, it is essential to consider appropriate sample sizes in microarray studies. With the growing use of the False Discovery Rate (FDR) in microarray analysis, an FDR-based sample size calculation is essential. Method: We describe an approach to explicitly connect the sample size to the FDR and the number of differentially expressed genes to be detected. The method fits parametric models for degree of differential expression using the Expectation–Maximization algorithm. Results: The applicability of the method is illustrated with simulations and studies of a lung microarray dataset. We propose to use a small training set or published data from relevant biological settings to calculate the sample size of an experiment. Availability: Code to implement the method in the statistical package R is available from the authors. Contact:jhu@mdanderson.orgKeywords
This publication has 18 references indexed in Scilit:
- A mixture model for estimating the local false discovery rate in DNA microarray analysisBioinformatics, 2004
- The positive false discovery rate: a Bayesian interpretation and the q-valueThe Annals of Statistics, 2003
- A mixture model approach to detecting differentially expressed genes with microarray dataFunctional & Integrative Genomics, 2003
- Identifying differentially expressed genes using false discovery rate controlling proceduresBioinformatics, 2003
- Power and sample size for DNA microarray studiesStatistics in Medicine, 2002
- A Statistical Framework for Expression-Based Molecular Classification in CancerJournal of the Royal Statistical Society Series B: Statistical Methodology, 2002
- Significance analysis of microarrays applied to the ionizing radiation responseProceedings of the National Academy of Sciences, 2001
- Testing for Differentially-Expressed Genes by Maximum-Likelihood Analysis of Microarray DataJournal of Computational Biology, 2000
- Ratio-based decisions and the quantitative analysis of cDNA microarray imagesJournal of Biomedical Optics, 1997
- THE GENERALIZATION OF ‘STUDENT'S’ PROBLEM WHEN SEVERAL DIFFERENT POPULATION VARLANCES ARE INVOLVEDBiometrika, 1947