A new non-linear normalization method for reducing variability in DNA microarray experiments
Top Cited Papers
Open Access
- 30 August 2002
- journal article
- Published by Springer Nature in Genome Biology
Abstract
Microarray data are subject to multiple sources of variation, of which biological sources are of interest whereas most others are only confounding. Recent work has identified systematic sources of variation that are intensity-dependent and non-linear in nature. Systematic sources of variation are not limited to the differing properties of the cyanine dyes Cy(5) and Cy(3) as observed in cDNA arrays, but are the general case for both oligonucleotide microarray (Affymetrix GeneChips) and cDNA microarray data. Current normalization techniques are most often linear and therefore not capable of fully correcting for these effects. We present here a simple and robust non-linear method for normalization using array signal distribution analysis and cubic splines. These methods compared favorably to normalization using robust local-linear regression (lowess). The application of these methods to oligonucleotide arrays reduced the relative error between replicates by 5-10% compared with a standard global normalization method. Application to cDNA arrays showed improvements over the standard method and over Cy(3)-Cy(5) normalization based on dye-swap replication. In addition, a set of known differentially regulated genes was ranked higher by the t-test. In either cDNA or Affymetrix technology, signal-dependent bias was more than ten times greater than the observed print-tip or spatial effects. Intensity-dependent normalization is important for both high-density oligonucleotide array and cDNA array data. Both the regression and spline-based methods described here performed better than existing linear methods when assessed on the variability of replicate arrays. Dye-swap normalization was less effective at Cy(3)-Cy(5) normalization than either regression or spline-based methods alone.Keywords
This publication has 17 references indexed in Scilit:
- Normalization for cDNA microarray data: a robust composite method addressing single and multiple slide systematic variationNucleic Acids Research, 2002
- Issues in cDNA microarray analysis: quality filtering, channel normalization, models of variations and assessment of gene effectsNucleic Acids Research, 2001
- Expression profiling reveals fundamental biological differences in acute myeloid leukemia with isolated trisomy 8 and normal cytogeneticsProceedings of the National Academy of Sciences, 2001
- Feature extraction and normalization algorithms for high-density oligonucleotide gene expression array dataJournal of Cellular Biochemistry, 2001
- Model-based analysis of oligonucleotide arrays: Expression index computation and outlier detectionProceedings of the National Academy of Sciences, 2000
- Model-based analysis of oligonucleotide arrays: Expression index computation and outlier detectionProceedings of the National Academy of Sciences, 2000
- Analysis of Variance for Gene Expression Microarray DataJournal of Computational Biology, 2000
- Manifold anomalies in gene expression in a vineyard isolate of Saccharomyces cerevisiae revealed by DNA microarray analysisProceedings of the National Academy of Sciences, 2000
- Genome-wide analysis of DNA copy number variation in breast cancer using DNA microarraysNature Genetics, 1999
- Functional analysis of the Bacillus subtilis purT gene encoding formate-dependent glycinamide ribonucleotide transformylaseMicrobiology, 1995