PLASQ: a generalized linear model-based procedure to determine allelic dosage in cancer cells from SNP array data
Open Access
- 20 June 2006
- journal article
- research article
- Published by Oxford University Press (OUP) in Biostatistics
- Vol. 8 (2) , 323-336
- https://doi.org/10.1093/biostatistics/kxl012
Abstract
Human cancer is largely driven by the acquisition of mutations. One class of such mutations is copy number polymorphisms, comprised of deviations from the normal diploid two copies of each autosomal chromosome per cell. We describe a probe-level allele-specific quantitation (PLASQ) procedure to determine copy number contributions from each of the parental chromosomes in cancer cells from single-nucleotide polymorphism (SNP) microarray data. Our approach is based upon a generalized linear model that takes advantage of a novel classification of probes on the array. As a result of this classification, we are able to fit the model to the data using an expectation-maximization algorithm designed for the purpose. We demonstrate a strong model fit to data from a variety of cell types. In normal diploid samples, PLASQ is able to genotype with very high accuracy. Moreover, we are able to provide a generalized genotype in cancer samples (e.g. CCCCT at an amplified SNP). Our approach is illustrated on a variety of lung cancer cell lines and tumors, and a number of events are validated by independent computational and experimental means. An R software package containing the methods is freely available.Keywords
This publication has 15 references indexed in Scilit:
- Allelic dosage analysis with genotyping microarraysBiochemical and Biophysical Research Communications, 2005
- Analysis of array CGH data: from signal ratio to gain and loss of DNA regionsBioinformatics, 2004
- Detection of large-scale variation in the human genomeNature Genetics, 2004
- dChipSNP: significance curve and clustering of SNP-array-based loss-of-heterozygosity dataBioinformatics, 2004
- High-Resolution Analysis of DNA Copy Number Using Oligonucleotide MicroarraysGenome Research, 2004
- Whole genome DNA copy number changes identified by high density oligonucleotide arraysHuman Genomics, 2004
- Exploration, normalization, and summaries of high density oligonucleotide array probe level dataBiostatistics, 2003
- A study of accuracy and precision in oligonucleotide arrays: extracting more signal at large concentrationsBioinformatics, 2003
- A comparison of normalization methods for high density oligonucleotide array data based on variance and biasBioinformatics, 2003
- Loss-of-heterozygosity analysis of small-cell lung carcinomas using single-nucleotide polymorphism arraysNature Biotechnology, 2000