A genotype calling algorithm for the Illumina BeadArray platform
Open Access
- 10 September 2007
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 23 (20) , 2741-2746
- https://doi.org/10.1093/bioinformatics/btm443
Abstract
Motivation: Large-scale genotyping relies on the use of unsupervised automated calling algorithms to assign genotypes to hybridization data. A number of such calling algorithms have been recently established for the Affymetrix GeneChip genotyping technology. Here, we present a fast and accurate genotype calling algorithm for the Illumina BeadArray genotyping platforms. As the technology moves towards assaying millions of genetic polymorphisms simultaneously, there is a need for an integrated and easy-to-use software for calling genotypes. Results: We have introduced a model-based genotype calling algorithm which does not rely on having prior training data or require computationally intensive procedures. The algorithm can assign genotypes to hybridization data from thousands of individuals simultaneously and pools information across multiple individuals to improve the calling. The method can accommodate variations in hybridization intensities which result in dramatic shifts of the position of the genotype clouds by identifying the optimal coordinates to initialize the algorithm. By incorporating the process of perturbation analysis, we can obtain a quality metric measuring the stability of the assigned genotype calls. We show that this quality metric can be used to identify SNPs with low call rates and accuracy. Availability: The C++ executable for the algorithm described here is available by request from the authors. Contact:teo@well.ox.ac.uk or tgc@well.ox.ac.ukKeywords
This publication has 15 references indexed in Scilit:
- Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controlsNature, 2007
- Genome-Wide Association Analysis Identifies Loci for Type 2 Diabetes and Triglyceride LevelsScience, 2007
- A Genome-Wide Association Study of Type 2 Diabetes in Finns Detects Multiple Susceptibility VariantsScience, 2007
- A Method to Address Differential Bias in Genotyping in Large-Scale Association StudiesPLoS Genetics, 2007
- Genome-wide association study identifies new susceptibility loci for Crohn disease and implicates autophagy in disease pathogenesisNature Genetics, 2007
- Genome-wide association study of prostate cancer identifies a second risk locus at 8q24Nature Genetics, 2007
- Optimal genotype determination in highly multiplexed SNP dataEuropean Journal of Human Genetics, 2005
- A genotype calling algorithm for affymetrix SNP arraysBioinformatics, 2005
- Dynamic model based algorithms for screening and genotyping over 100K SNPs on oligonucleotide microarraysBioinformatics, 2005
- A comparison of normalization methods for high density oligonucleotide array data based on variance and biasBioinformatics, 2003