PolyScan: An automatic indel and SNP detection approach to the analysis of human resequencing data
Open Access
- 6 April 2007
- journal article
- Published by Cold Spring Harbor Laboratory in Genome Research
- Vol. 17 (5) , 659-666
- https://doi.org/10.1101/gr.6151507
Abstract
Small insertions and deletions (indels) and single nucleotide polymorphisms (SNPs) are common genetic variants that are thought to be associated with a wide variety of human diseases. Owing to the genome’s size and complexity, manually characterizing each one of these variations in an individual is not practical. While significant progress has been made in automated single-base mutation discovery from the sequences of diploid PCR products, automated and reliable detection of indels continues to pose difficult challenges. In this paper, we present PolyScan, an algorithm and software implementation designed to provide de novo heterozygous indel detection and improved SNP identification in the context of high-throughput medical resequencing. Tests on a human diploid PCR-based sequence data set, consisting of 90,270 traces from 13 genes, indicate that PolyScan identified ∼90% of the 151 consensus indel sites and ∼84% of the 1546 heterozygous indels previously identified by manual inspection. Tests on tumor-derived data show that PolyScan better identifies high-quality, low-level mutations as compared with other mutation detection software. Moreover, SNP identification improves when reprocessing the results of other programs. These results suggest that PolyScan may play a useful role in the post human genome project research era.Keywords
This publication has 29 references indexed in Scilit:
- An initial map of insertion and deletion (INDEL) variation in the human genomeGenome Research, 2006
- A high-resolution survey of deletion polymorphism in the human genomeNature Genetics, 2005
- SNPdetector: A Software Tool for Sensitive and Accurate SNP DetectionPLoS Computational Biology, 2005
- A haplotype map of the human genomeNature, 2005
- Genome sequencing in microfabricated high-density picolitre reactorsNature, 2005
- InSNP: A tool for automated detection and visualization of SNPs and InDelsHuman Mutation, 2005
- Fine-scale structural variation of the human genomeNature Genetics, 2005
- EGF receptor gene mutations are common in lung cancers from “never smokers” and are associated with sensitivity of tumors to gefitinib and erlotinibProceedings of the National Academy of Sciences, 2004
- Detection of large-scale variation in the human genomeNature Genetics, 2004
- Sequence-based cancer genomics: progress, lessons and opportunitiesNature Reviews Genetics, 2003