A geometric approach for classification and comparison of structural variants
Open Access
- 27 May 2009
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 25 (12) , i222-i230
- https://doi.org/10.1093/bioinformatics/btp208
Abstract
Motivation: Structural variants, including duplications, insertions, deletions and inversions of large blocks of DNA sequence, are an important contributor to human genome variation. Measuring structural variants in a genome sequence is typically more challenging than measuring single nucleotide changes. Current approaches for structural variant identification, including paired-end DNA sequencing/mapping and array comparative genomic hybridization (aCGH), do not identify the boundaries of variants precisely. Consequently, most reported human structural variants are poorly defined and not readily compared across different studies and measurement techniques. Results: We introduce Geometric Analysis of Structural Variants (GASV), a geometric approach for identification, classification and comparison of structural variants. This approach represents the uncertainty in measurement of a structural variant as a polygon in the plane, and identifies measurements supporting the same variant by computing intersections of polygons. We derive a computational geometry algorithm to efficiently identify all such intersections. We apply GASV to sequencing data from nine individual human genomes and several cancer genomes. We obtain better localization of the boundaries of structural variants, distinguish genetic from putative somatic structural variants in cancer genomes, and integrate aCGH and paired-end sequencing measurements of structural variants. This work presents the first general framework for comparing structural variants across multiple samples and measurement techniques, and will be useful for studies of both genetic structural variants and somatic rearrangements in cancer. Availability:http://cs.brown.edu/people/braphael/software.html Contact:braphael@brown.eduKeywords
This publication has 36 references indexed in Scilit:
- Systematic assessment of copy number variant detection via genome-wide SNP genotypingNature Genetics, 2008
- Mapping and sequencing of structural variation from eight human genomesNature, 2008
- Identification of somatically acquired rearrangements in cancer using genome-wide massively parallel paired-end sequencingNature Genetics, 2008
- The complete genome of an individual by massively parallel DNA sequencingNature, 2008
- The Fine-Scale and Complex Architecture of Human Copy-Number VariationAmerican Journal of Human Genetics, 2008
- Structural Variation of Chromosomes in Autism Spectrum DisorderAmerican Journal of Human Genetics, 2008
- Challenges and standards in integrating surveys of structural variationNature Genetics, 2007
- Global variation in copy number in the human genomeNature, 2006
- A high-resolution survey of deletion polymorphism in the human genomeNature Genetics, 2005
- Fine-scale structural variation of the human genomeNature Genetics, 2005