Gametic phase estimation over large genomic regions using an adaptive window approach
Open Access
- 1 January 2003
- journal article
- research article
- Published by Springer Nature in Human Genomics
- Vol. 1 (1) , 7-19
- https://doi.org/10.1186/1479-7364-1-1-7
Abstract
The authors present ELB, an easy to programme and computationally fast algorithm for inferring gametic phase in population samples of multilocus genotypes. Phase updates are made on the basis of a window of neighbouring loci, and the window size varies according to the local level of linkage disequilibrium. Thus, ELB is particularly well suited to problems involving many loci and/or relatively large genomic regions, including those with variable recombination rate. The authors have simulated population samples of single nucleotide polymorphism genotypes with varying levels of recombination and marker density, and find that ELB provides better local estimation of gametic phase than the PHASE or HTYPER programs, while its global accuracy is broadly similar. The relative improvement in local accuracy increases both with increasing recombination and with increasing marker density. Short tandem repeat (STR, or microsatellite) simulation studies demonstrate ELB's superiority over PHASE both globally and locally. Missing data are handled by ELB; simulations show that phase recovery is virtually unaffected by up to 2 per cent of missing data, but that phase estimation is noticeably impaired beyond this amount. The authors also applied ELB to datasets obtained from random pairings of 42 human X chromosomes typed at 97 diallelic markers in a 200 kb low-recombination region. Once again, they found ELB to have consistently better local accuracy than PHASE or HTYPER, while its global accuracy was close to the best.</pKeywords
This publication has 34 references indexed in Scilit:
- Haplotype Inference in Random Population SamplesAmerican Journal of Human Genetics, 2002
- Assessing Population Differentiation and Isolation from Single-Nucleotide Polymorphism DataJournal of the Royal Statistical Society Series B: Statistical Methodology, 2002
- The Structure of Haplotype Blocks in the Human GenomeScience, 2002
- Patterns of linkage disequilibrium in the human genomeNature Reviews Genetics, 2002
- Comparisons of Two Methods for Haplotype Reconstruction and Haplotype Frequency Estimation from Population DataAmerican Journal of Human Genetics, 2001
- Haplotyping and estimation of haplotype frequencies for closely linked biallelic multilocus genetic phenotypes including nuclear family informationHuman Mutation, 2001
- Homogeneous Assays for Single-Nucleotide Polymorphism Typing Using AlphaScreenGenome Research, 2001
- Accuracy of Haplotype Frequency Estimation for Biallelic Loci, via the Expectation-Maximization Algorithm for Unphased Diploid Genotype DataAmerican Journal of Human Genetics, 2000
- The Accuracy of Statistical Methods for Estimation of Haplotype Frequencies: An Example from the CD4 LocusAmerican Journal of Human Genetics, 2000
- DNA Variation in a 5-Mb Region of the X Chromosome and Estimates of Sex-Specific/Type-Specific Mutation RatesAmerican Journal of Human Genetics, 1999