A robust framework for detecting structural variations in a genome
Open Access
- 1 July 2008
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 24 (13) , i59-i67
- https://doi.org/10.1093/bioinformatics/btn176
Abstract
Motivation: Recently, structural genomic variants have come to the forefront as a significant source of variation in the human population, but the identification of these variants in a large genome remains a challenge. The complete sequencing of a human individual is prohibitive at current costs, while current polymorphism detection technologies, such as SNP arrays, are not able to identify many of the large scale events. One of the most promising methods to detect such variants is the computational mapping of clone-end sequences to a reference genome. Results: Here, we present a probabilistic framework for the identification of structural variants using clone-end sequencing. Unlike previous methods, our approach does not rely on an a priori determined mapping of all reads to the reference. Instead, we build a framework for finding the most probable assignment of sequenced clones to potential structural variants based on the other clones. We compare our predictions with the structural variants identified in three previous studies. While there is a statistically significant correlation between the predictions, we also find a significant number of previously uncharacterized structural variants. Furthermore, we identify a number of putative cross-chromosomal events, primarily located proximally to the centromeres of the chromosomes. Availability: Our dataset, results and source code are available at http://compbio.cs.toronto.edu/structvar/ Contact: seunghak@cs.toronto.edu,echeran@cs.toronto.edu,brudno@cs.toronto.eduKeywords
This publication has 20 references indexed in Scilit:
- Paired-End Mapping Reveals Extensive Structural Variation in the Human GenomeScience, 2007
- The Diploid Genome Sequence of an Individual HumanPLoS Biology, 2007
- An initial map of insertion and deletion (INDEL) variation in the human genomeGenome Research, 2006
- Structural variation in the human genomeNature Reviews Genetics, 2006
- Fine-scale structural variation of the human genomeNature Genetics, 2005
- The International HapMap ProjectNature, 2003
- The Human Genome Browser at UCSCGenome Research, 2002
- BLAT—The BLAST-Like Alignment ToolGenome Research, 2002
- Initial sequencing and analysis of the human genomeNature, 2001
- A Method for Comparing Two Hierarchical ClusteringsJournal of the American Statistical Association, 1983