A Measure of Phase Ambiguity in Pairs of SNPs in the Presence ofLinkage Disequi librium
- 1 July 2000
- journal article
- Published by S. Karger AG in Human Heredity
- Vol. 50 (6) , 359-364
- https://doi.org/10.1159/000022941
Abstract
The extent of haplotype ambiguity in a string of single-nucleotide polymorphisms (SNPs) was quantified by Hodge et al. [Nat Genet 1999;21:360]. In their measure, the level of ambiguity increases with increasing numbers of loci and as loci become more polymorphic. That work assumed linkage equilibrium (LE). However, linkage disequilibrium (LD) provides additional information about the haplotypes at a site, thereby diluting the level of ambiguity. The ambiguity vanishes altogether when LD reaches its maximum value. Here, we introduce the ambiguity measure, Φ, to allow for LD (between pairs of SNPs). We derive the formula Φ = 4x2x3 for ambiguity in individuals, where x1, x2, x3 and x4 are the probabilites of the A1A2, A1B2, B1A2 and B1B2 haplotypes, respectively, and w.l.o.g. x1x4 ≥ x2x3. Alternatively, Φ can be expressed in terms of the allele frequencies and the LD parameter δ. We also extend the formula to triads of two parents plus one child. We estimate our measure Φ for relevant SNPs in the published lipoprotein lipase (LPL) gene dataset [Clark et al., Am J Hum Genet 1998;63:595; Nickerson et al., Nat Genet 1998;19:233], obtaining values ranging from a low of 0 to a high of 0.11 among adjacent pairs of sites. In genome-wide LD studies to map common disease genes, a dense map of SNPs may be utilized to detect association between a marker and disease. Therefore, the measurement of ambiguity can potentially help investigators to determine a more efficient map, designed to minimize ambiguity and subsequent information loss.Keywords
This publication has 6 references indexed in Scilit:
- Genetic variance components analysis for binary phenotypes using generalized linear mixed models (GLMMs) and Gibbs samplingGenetic Epidemiology, 1999
- Loss of information due to ambiguous haplotyping of SNPsNature Genetics, 1999
- True Pedigree Errors More Frequent Than Apparent Errors for Single Nucleotide PolymorphismsHuman Heredity, 1999
- Haplotype Structure and Population Genetic Inferences from Nucleotide-Sequence Variation in Human Lipoprotein LipaseAmerican Journal of Human Genetics, 1998
- DNA sequence diversity in a 9.7-kb region of the human lipoprotein lipase geneNature Genetics, 1998
- Bayes estimates for immunological progression rates in HIV diseaseStatistics in Medicine, 1997