The impact of genotyping error on haplotype reconstruction and frequency estimation
- 2 October 2002
- journal article
- research article
- Published by Springer Nature in European Journal of Human Genetics
- Vol. 10 (10) , 616-622
- https://doi.org/10.1038/sj.ejhg.5200855
Abstract
The choice of genotyping families vs unrelated individuals is a critical factor in any large-scale linkage disequilibrium (LD) study. The use of unrelated individuals for such studies is promising, but in contrast to family designs, unrelated samples do not facilitate detection of genotyping errors, which have been shown to be of great importance for LD and linkage studies and may be even more important in genotyping collaborations across laboratories. Here we employ some of the most commonly-used analysis methods to examine the relative accuracy of haplotype estimation using families vs unrelateds in the presence of genotyping error. The results suggest that even slight amounts of genotyping error can significantly decrease haplotype frequency and reconstruction accuracy, that the ability to detect such errors in large families is essential when the number/complexity of haplotypes is high (low LD/common alleles). In contrast, in situations of low haplotype complexity (high LD and/or many rare alleles) unrelated individuals offer such a high degree of accuracy that there is little reason for less efficient family designs. Moreover, parent-child trios, which comprise the most popular family design and the most efficient in terms of the number of founder chromosomes per genotype but which contain little information for error detection, offer little or no gain over unrelated samples in nearly all cases, and thus do not seem a useful sampling compromise between unrelated individuals and large families. The implications of these results are discussed in the context of large-scale LD mapping projects such as the proposed genome-wide haplotype map.Keywords
This publication has 31 references indexed in Scilit:
- Merlin—rapid analysis of dense genetic maps using sparse gene flow treesNature Genetics, 2001
- A New Statistical Method for Haplotype Reconstruction from Population DataAmerican Journal of Human Genetics, 2001
- A map of human genome sequence variation containing 1.42 million single nucleotide polymorphismsNature, 2001
- The impact of genotyping error on family-based analysis of quantitative traitsEuropean Journal of Human Genetics, 2001
- Accuracy of Haplotype Frequency Estimation for Biallelic Loci, via the Expectation-Maximization Algorithm for Unphased Diploid Genotype DataAmerican Journal of Human Genetics, 2000
- The Accuracy of Statistical Methods for Estimation of Haplotype Frequencies: An Example from the CD4 LocusAmerican Journal of Human Genetics, 2000
- A Multipoint Method for Detecting Genotyping Errors and Mutations in Sibling-Pair Linkage DataAmerican Journal of Human Genetics, 2000
- True Pedigree Errors More Frequent Than Apparent Errors for Single Nucleotide PolymorphismsHuman Heredity, 1999
- Systematic detection of errors in genetic linkage dataGenomics, 1992
- Estimation of linkage disequilibrium in randomly mating populationsHeredity, 1979