Haplotype and Missing Data Inference in Nuclear Families
Open Access
- 15 July 2004
- journal article
- Published by Cold Spring Harbor Laboratory in Genome Research
- Vol. 14 (8) , 1624-1632
- https://doi.org/10.1101/gr.2204604
Abstract
Determining linkage phase from population samples with statistical methods is accurate only within regions of high linkage disequilibrium (LD). Yet, affected individuals in a genetic mapping study, including those involving cases and controls, may share sequences identical-by-descent stretching on the order of 10s to 100s of kilobases, quite possibly over regions of low LD in the population. At the same time, inferring phase from nuclear families may be hampered by missing family members, missing genotypes, and the noninformativity of certain genotype patterns. In this study, we reformulate our previous haplotype reconstruction algorithm, and its associated computer program, to phase parents with information derived from population samples as well as from their offspring. In applications of our algorithm to 100-kb stretches, simulated in accordance to a Wright-Fisher model with typical levels of LD in humans, we find that phase reconstruction for 160 trios with 10% missing data is highly accurate (>90%) over the entire length. Furthermore, our algorithm can estimate allelic status for missing data at high accuracy (>95%). Finally, the input capacity of the program is vast, easily handling thousands of segregating sites in ≥1000 chromosomes.Keywords
This publication has 39 references indexed in Scilit:
- A Comparison of Bayesian Methods for Haplotype Reconstruction from Population Genotype DataAmerican Journal of Human Genetics, 2003
- Efficiency of Haplotype Frequency Estimation when Nuclear Familiy Information Is IncludedHuman Heredity, 2002
- Haplotype Inference in Random Population SamplesAmerican Journal of Human Genetics, 2002
- On the advantage of haplotype analysis in the presence of multiple disease susceptibility allelesGenetic Epidemiology, 2002
- Caution on Pedigree Haplotype Inference with Software That Assumes Linkage EquilibriumAmerican Journal of Human Genetics, 2002
- Genetic Variation Analysis of Neuropsychiatric TraitsPublished by Taylor & Francis ,2001
- Are Rare Variants Responsible for Susceptibility to Complex Diseases?American Journal of Human Genetics, 2001
- Haplotyping and estimation of haplotype frequencies for closely linked biallelic multilocus genetic phenotypes including nuclear family informationHuman Mutation, 2001
- A New Statistical Method for Haplotype Reconstruction from Population DataAmerican Journal of Human Genetics, 2001
- Effect of allelic heterogeneity on the power of the transmission disequilibrium testGenetic Epidemiology, 2000