Genotype and SNP calling from next-generation sequencing data
Top Cited Papers
- 18 May 2011
- journal article
- review article
- Published by Springer Nature in Nature Reviews Genetics
- Vol. 12 (6) , 443-451
- https://doi.org/10.1038/nrg2986
Abstract
Converting next-generation sequencing (NGS) image files into a set of called SNPs involves a number of steps including image analysis, alignment and assembly, SNP calling and genotype calling. Genotype probabilities for a single individual can be calculated from alignments using recalibrated quality scores. SNP calling and genotype calling is best done using information from multiple individuals simultaneously. The pattern of linkage disequilibrium should be used to call SNPs and genotypes when possible. Analyses of low coverage data can proceed by taking uncertainty in the genotype calls into account, rather than assuming any particular genotype call is correct. The methods used for calling SNPs and for taking uncertainty in SNP genotypes into account can have a strong effect on downstream analyses, including association mapping analyses.Keywords
This publication has 54 references indexed in Scilit:
- A framework for variation discovery and genotyping using next-generation DNA sequencing dataNature Genetics, 2011
- A map of human genome variation from population-scale sequencingNature, 2010
- Ab initio reconstruction of cell type–specific transcriptomes in mouse reveals the conserved multi-exonic structure of lincRNAsNature Biotechnology, 2010
- Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiationNature Biotechnology, 2010
- Intensity normalization improves color calling in SOLiD sequencingNature Methods, 2010
- Exome sequencing identifies the cause of a mendelian disorderNature Genetics, 2009
- The Relationship between Imputation Error and Statistical Power in Genetic Association Studies in Diverse PopulationsAmerican Journal of Human Genetics, 2009
- Population genomics of domestic and wild yeastsNature, 2009
- The diploid genome sequence of an Asian individualNature, 2008
- A new multipoint method for genome-wide association studies by imputation of genotypesNature Genetics, 2007