Evaluation of normalization procedures for oligonucleotide array data based on spiked cRNA controls
Open Access
- 21 November 2001
- journal article
- research article
- Published by Springer Nature in Genome Biology
Abstract
Affymetrix oligonucleotide arrays simultaneously measure the abundances of thousands of mRNAs in biological samples. Comparability of array results is necessary for the creation of large-scale gene expression databases. The standard strategy for normalizing oligonucleotide array readouts has practical drawbacks. We describe alternative normalization procedures for oligonucleotide arrays based on a common pool of known biotin-labeled cRNAs spiked into each hybridization. We first explore the conditions for validity of the 'constant mean assumption', the key assumption underlying current normalization methods. We introduce 'frequency normalization', a 'spike-in'-based normalization method which estimates array sensitivity, reduces background noise and allows comparison between array designs. This approach does not rely on the constant mean assumption and so can be effective in conditions where standard procedures fail. We also define 'scaled frequency', a hybrid normalization method relying on both spiked transcripts and the constant mean assumption while maintaining all other advantages of frequency normalization. We compare these two procedures to a standard global normalization method using experimental data. We also use simulated data to estimate accuracy and investigate the effects of noise. We find that scaled frequency is as reproducible and accurate as global normalization while offering several practical advantages. Scaled frequency quantitation is a convenient, reproducible technique that performs as well as global normalization on serial experiments with the same array design, while offering several additional features. Specifically, the scaled-frequency method enables the comparison of expression measurements across different array designs, yields estimates of absolute message abundance in cRNA and determines the sensitivity of individual arrays.Keywords
This publication has 20 references indexed in Scilit:
- Normalization for cDNA microarray data: a robust composite method addressing single and multiple slide systematic variationNucleic Acids Research, 2002
- Quantitative analysis of mRNA amplification by in vitro transcriptionNucleic Acids Research, 2001
- Model-based analysis of oligonucleotide arrays: Expression index computation and outlier detectionProceedings of the National Academy of Sciences, 2000
- Genomic Analysis of Gene Expression in C. elegansScience, 2000
- Direct Comparison of GeneChip and SAGE on the Quantitative Accuracy in Transcript Profiling AnalysisGenomics, 2000
- Normalization strategies for cDNA microarraysNucleic Acids Research, 2000
- Gene Expression Profile of Aging and Its Retardation by Caloric RestrictionScience, 1999
- Genome Sequence of the Nematode C. elegans : A Platform for Investigating BiologyScience, 1998
- Genome-wide expression monitoring in Saccharomyces cerevisiaeNature Biotechnology, 1997
- Expression monitoring by hybridization to high-density oligonucleotide arraysNature Biotechnology, 1996