Quantile smoothing of array CGH data
- 30 November 2004
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 21 (7) , 1146-1153
- https://doi.org/10.1093/bioinformatics/bti148
Abstract
Motivation: Plots of array Comparative Genomic Hybridization (CGH) data often show special patterns: stretches of constant level (copy number) with sharp jumps between them. There can also be much noise. Classic smoothing algorithms do not work well, because they introduce too much rounding. To remedy this, we introduce a fast and effective smoothing algorithm based on penalized quantile regression. It can compute arbitrary quantile curves, but we concentrate on the median to show the trend and the lower and upper quartile curves showing the spread of the data. Two-fold cross-validation is used for optimizing the weight of the penalties. Results: Simulated data and a published dataset are used to show the capabilities of the method to detect the segments of changed copy numbers in array CGH data. Availability: Software for R and Matlab is available. Contact: p.eilers@lumc.nlKeywords
This publication has 14 references indexed in Scilit:
- Circular binary segmentation for the analysis of array-based DNA copy number dataBiostatistics, 2004
- Analysis of array CGH data: from signal ratio to gain and loss of DNA regionsBioinformatics, 2004
- A simple significance test for quantile regressionStatistics in Medicine, 2004
- Breakpoint identification and smoothing of array comparative genomic hybridization dataBioinformatics, 2004
- High-resolution analysis of DNA copy number alterations in colorectal cancer by array-based comparative genomic hybridizationCarcinogenesis: Integrative Cancer Research, 2004
- A Perfect SmootherAnalytical Chemistry, 2003
- Local Extremes, Runs, Strings and MultiresolutionThe Annals of Statistics, 2001
- Goodness of Fit and Related Inference Processes for Quantile RegressionJournal of the American Statistical Association, 1999
- The Gaussian hare and the Laplacian tortoise: computability of squared-error versus absolute-error estimatorsStatistical Science, 1997
- Four (Pathological) Examples in Asymptotic StatisticsThe American Statistician, 1984