wuHMM: a robust algorithm to detect DNA copy number variation using long oligonucleotide microarray data
Open Access
- 11 March 2008
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 36 (7) , e41
- https://doi.org/10.1093/nar/gkn110
Abstract
Copy number variants (CNVs) are currently defined as genomic sequences that are polymorphic in copy number and range in length from 1000 to several million base pairs. Among current array-based CNV detection platforms, long-oligonucleotide arrays promise the highest resolution. However, the performance of currently available analytical tools suffers when applied to these data because of the lower signal:noise ratio inherent in oligonucleotide-based hybridization assays. We have developed wuHMM, an algorithm for mapping CNVs from array comparative genomic hybridization (aCGH) platforms comprised of 385 000 to more than 3 million probes. wuHMM is unique in that it can utilize sequence divergence information to reduce the false positive rate (FPR). We apply wuHMM to 385K-aCGH, 2.1M-aCGH and 3.1M-aCGH experiments comparing the 129X1/SvJ and C57BL/6J inbred mouse genomes. We assess wuHMM's performance on the 385K platform by comparison to the higher resolution platforms and we independently validate 10 CNVs. The method requires no training data and is robust with respect to changes in algorithm parameters. At a FPR of <10%, the algorithm can detect CNVs with five probes on the 385K platform and three on the 2.1M and 3.1M platforms, resulting in effective resolutions of 24 kb, 2–5 kb and 1 kb, respectively.Keywords
This publication has 41 references indexed in Scilit:
- A sequence-based variation map of 8.27 million SNPs in inbred mouse strainsNature, 2007
- Methods and strategies for analyzing copy number variation using DNA microarraysNature Genetics, 2007
- Systematic prediction and validation of breakpoints associated with copy-number variants in the human genomeProceedings of the National Academy of Sciences, 2007
- A Comprehensive Analysis of Common Copy-Number Variations in the Human GenomeAmerican Journal of Human Genetics, 2007
- Mouse Phenome Database (MPD)Nucleic Acids Research, 2006
- Global variation in copy number in the human genomeNature, 2006
- A Chromosome 8 Gene-Cluster Polymorphism with Low Human Beta-Defensin 2 Gene Copy Number Predisposes to Crohn Disease of the ColonAmerican Journal of Human Genetics, 2006
- Hotspots for copy number variation in chimpanzees and humansProceedings of the National Academy of Sciences, 2006
- Copy number polymorphism in Fcgr3 predisposes to glomerulonephritis in rats and humansNature, 2006
- BAC to the future! or oligonucleotides: a perspective for micro array comparative genomic hybridization (array CGH)Nucleic Acids Research, 2006