ReadDepth: A Parallel R Package for Detecting Copy Number Alterations from Short Sequencing Reads
Open Access
- 31 January 2011
- journal article
- research article
- Published by Public Library of Science (PLoS) in PLOS ONE
- Vol. 6 (1) , e16327
- https://doi.org/10.1371/journal.pone.0016327
Abstract
Copy number alterations are important contributors to many genetic diseases, including cancer. We present the readDepth package for R, which can detect these aberrations by measuring the depth of coverage obtained by massively parallel sequencing of the genome. In addition to achieving higher accuracy than existing packages, our tool runs much faster by utilizing multi-core architectures to parallelize the processing of these large data sets. In contrast to other published methods, readDepth does not require the sequencing of a reference sample, and uses a robust statistical model that accounts for overdispersed data. It includes a method for effectively increasing the resolution obtained from low-coverage experiments by utilizing breakpoint information from paired end sequencing to do positional refinement. We also demonstrate a method for inferring copy number using reads generated by whole-genome bisulfite sequencing, thus enabling integrative study of epigenomic and copy number alterations. Finally, we apply this tool to two genomes, showing that it performs well on genomes sequenced to both low and high coverage. The readDepth package runs on Linux and MacOSX, is released under the Apache 2.0 license, and is available at http://code.google.com/p/readdepth/.Keywords
This publication has 23 references indexed in Scilit:
- Pash 3.0: A versatile software package for read mapping and integrative analysis of genomic and epigenomic variation using massively parallel DNA sequencingBMC Bioinformatics, 2010
- Human DNA methylomes at base resolution show widespread epigenomic differencesNature, 2009
- Personalized copy number and segmental duplication maps using next-generation sequencingNature Genetics, 2009
- The first Korean genome sequence and analysis: Full genome sequencing for a socio-ethnic groupGenome Research, 2009
- Detection of large-scale variation in the human genomeNature Genetics, 2004
- MOLECULARMECHANISMS FORGENOMICDISORDERSAnnual Review of Genomics and Human Genetics, 2002
- Assembly of microarrays for genome-wide measurement of DNA copy numberNature Genetics, 2001
- Structure of Chromosomal Duplicons and their Role in Mediating Human Genomic DisordersGenome Research, 2000
- High resolution analysis of DNA copy number variation using comparative genomic hybridization to microarraysNature Genetics, 1998
- A genomic sequencing protocol that yields a positive display of 5-methylcytosine residues in individual DNA strands.Proceedings of the National Academy of Sciences, 1992