Haplotype frequency estimation in patient populations: The effect of departures from Hardy‐Weinberg proportions and collapsing over a locus in the HLA region
- 10 January 2002
- journal article
- research article
- Published by Wiley in Genetic Epidemiology
- Vol. 22 (2) , 186-195
- https://doi.org/10.1002/gepi.0163
Abstract
Haplotype analyses are an important area in the study of the genetic components of human disease. Associations between markers and disease loci that are not evident with a single marker locus may be identified in multi‐locus marker analyses using estimated haplotype frequencies (HFs). Procedures that make use of the expectation‐maximization (EM) algorithm to estimate HFs from unphased genotype data are in common use in genetic studies. The EM algorithm uses these unphased genotype frequencies along with the assumption of Hardy‐Weinberg proportions (HWP) to converge on HF estimates. In this paper, we assess the accuracy of EM estimates of HFs in patients with type I diabetes for whom the true haplotypes are known, but the data are analyzed ignoring family information to allow comparison between estimated and true frequencies. The data consist of six HLA loci with high levels of polymorphism and a range of departures from HWP and linkage equilibrium. While the overall accuracy of the EM estimates is good, there can be large over‐ and underestimates of particular HFs, even for common haplotypes, especially when the loci involved deviate significantly from HWP. Estimating HFs for three or more loci and then collapsing over loci so as to generate two locus haplotypes can improve the accuracy of the estimation. The collapsing procedure is most beneficial when one of the loci in the two‐locus haplotype of interest deviates significantly from HWP and the locus collapsed over is in linkage disequilibrium with the other loci. Genet. Epidemiol. 22:186–195, 2002.Keywords
This publication has 14 references indexed in Scilit:
- Accuracy of Haplotype Frequency Estimation for Biallelic Loci, via the Expectation-Maximization Algorithm for Unphased Diploid Genotype DataAmerican Journal of Human Genetics, 2000
- The Accuracy of Statistical Methods for Estimation of Haplotype Frequencies: An Example from the CD4 LocusAmerican Journal of Human Genetics, 2000
- Strategies in complex disease mappingCurrent Opinion in Genetics & Development, 2000
- The HLA class II locus DPB1 can influence susceptibility to type 1 diabetes.Diabetes, 2000
- Association between type 1 diabetes age of onset and HLA among sibling pairs.Diabetes, 1999
- Evidence for linkage and association to alcohol dependence on chromosome 19Genetic Epidemiology, 1999
- Validation of haplotype frequency estimation methodsHuman Immunology, 1998
- Association Mapping of Disease Loci, by Use of a Pooled DNA Genomic ScreenAmerican Journal of Human Genetics, 1997
- Genetic analysis of type 1 diabetes using whole genome approaches.Proceedings of the National Academy of Sciences, 1995