Frozen robust multiarray analysis (fRMA)
Top Cited Papers
Open Access
- 22 January 2010
- journal article
- research article
- Published by Oxford University Press (OUP) in Biostatistics
- Vol. 11 (2) , 242-253
- https://doi.org/10.1093/biostatistics/kxp059
Abstract
Robust multiarray analysis (RMA) is the most widely used preprocessing algorithm for Affymetrix and Nimblegen gene expression microarrays. RMA performs background correction, normalization, and summarization in a modular way. The last 2 steps require multiple arrays to be analyzed simultaneously. The ability to borrow information across samples provides RMA various advantages. For example, the summarization step fits a parametric model that accounts for probe effects, assumed to be fixed across arrays, and improves outlier detection. Residuals, obtained from the fitted model, permit the creation of useful quality metrics. However, the dependence on multiple arrays has 2 drawbacks: (1) RMA cannot be used in clinical settings where samples must be processed individually or in small batches and (2) data sets preprocessed separately are not comparable. We propose a preprocessing algorithm, frozen RMA (fRMA), which allows one to analyze microarrays individually or in small batches and then combine the data for analysis. This is accomplished by utilizing information from the large publicly available microarray databases. In particular, estimates of probe-specific effects and variances are precomputed and frozen. Then, with new data sets, these are used in concert with information from the new arrays to normalize and summarize the data. We find that fRMA is comparable to RMA when the data are analyzed as a single batch and outperforms RMA when analyzing multiple batches. The methods described here are implemented in the R package fRMA and are currently available for download from the software section of http://rafalab.jhsph.edu.Keywords
This publication has 15 references indexed in Scilit:
- ArrayExpress update--from an archive of functional genomics experiments to the atlas of gene expressionNucleic Acids Research, 2009
- Consolidated strategy for the analysis of microarray spike-in dataNucleic Acids Research, 2008
- Quality Assessment for Short Oligonucleotide Microarray DataTechnometrics, 2008
- Comparison of Affymetrix GeneChip expression measuresBioinformatics, 2006
- A Model-Based Background Adjustment for Oligonucleotide Expression ArraysJournal of the American Statistical Association, 2004
- A gene atlas of the mouse and human protein-encoding transcriptomesProceedings of the National Academy of Sciences, 2004
- affy—analysis of Affymetrix GeneChip data at the probe levelBioinformatics, 2004
- Effects of Atmospheric Ozone on Microarray Data QualityAnalytical Chemistry, 2003
- Exploration, normalization, and summaries of high density oligonucleotide array probe level dataBiostatistics, 2003
- Gene Expression Omnibus: NCBI gene expression and hybridization array data repositoryNucleic Acids Research, 2002