Effect of pooling samples on the efficiency of comparative studies using microarrays
Open Access
- 18 October 2005
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 21 (24) , 4378-4383
- https://doi.org/10.1093/bioinformatics/bti717
Abstract
Motivation: Many biomedical experiments are carried out by pooling individual biological samples. However, pooling samples can potentially hide biological variance and give false confidence concerning the data significance. In the context of microarray experiments for detecting differentially expressed genes, recent publications have addressed the problem of the efficiency of sample pooling, and some approximate formulas were provided for the power and sample size calculations. It is desirable to have exact formulas for these calculations and have the approximate results checked against the exact ones. We show that the difference between the approximate and the exact results can be large. Results: In this study, we have characterized quantitatively the effect of pooling samples on the efficiency of microarray experiments for the detection of differential gene expression between two classes. We present exact formulas for calculating the power of microarray experimental designs involving sample pooling and technical replications. The formulas can be used to determine the total number of arrays and biological subjects required in an experiment to achieve the desired power at a given significance level. The conditions under which pooled design becomes preferable to non-pooled design can then be derived given the unit cost associated with a microarray and that with a biological subject. This paper thus serves to provide guidance on sample pooling and cost-effectiveness. The formulation in this paper is outlined in the context of performing microarray comparative studies, but its applicability is not limited to microarray experiments. It is also applicable to a wide range of biomedical comparative studies where sample pooling may be involved. Availability: A Java Webstart application can be accessed at Contact:sdz1@le.ac.uk; twg1@le.ac.ukKeywords
All Related Versions
This publication has 17 references indexed in Scilit:
- Pooling samples within microarray studies: a comparative analysis of rat liver transcription response to prototypical toxicantsPhysiological Genomics, 2005
- A practical false discovery rate approach to identifying patterns of differential expression in microarray dataBioinformatics, 2005
- Effects of pooling mRNA in microarray class comparisonsBioinformatics, 2004
- A statistical framework for the design of microarray experiments and effective detection of differential gene expressionBioinformatics, 2004
- Large-Scale Simultaneous Hypothesis TestingJournal of the American Statistical Association, 2004
- Statistical significance for genomewide studiesProceedings of the National Academy of Sciences, 2003
- Estimating the occurrence of false positives and false negatives in microarray studies by approximating and partitioning the empirical distribution of p-valuesBioinformatics, 2003
- The efficiency of pooling mRNA in microarray experimentsBiostatistics, 2003
- Regulatory defects in liver and intestine implicate abnormal hepcidin and Cybrd1 expression in mouse hemochromatosisNature Genetics, 2003
- The Efficiency of Pooling in the Detection of Rare MutationsAmerican Journal of Human Genetics, 2000