Penalized estimation of haplotype frequencies
Open Access
- 16 May 2008
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 24 (14) , 1596-1602
- https://doi.org/10.1093/bioinformatics/btn236
Abstract
Motivation: Low haplotype diversity and linkage disequilibrium are the rule in short genomic segments. This fact suggests that parsimony should be enforced in estimation of haplotype frequencies. The current article introduces a diversity penalty that automatically discards potential haplotypes with low explanatory power. The standard EM algorithm for haplotype frequency estimation can accommodate the penalty if one passes over to a more general minorize–maximize (MM) scheme for estimation. Results: Our new MM algorithm converges in fewer iterations, eliminates marginal haplotypes from further consideration and reduces the computational complexity of each iteration. Estimation by the MM algorithm also improves haplotyping and genotype imputation compared to naive application of the EM algorithm. Thus, the MM algorithm is a useful substitute for the EM algorithm. Compared to the most sophisticated current methods of haplotyping and genotype imputation, the MM algorithm is slightly less accurate but at least an order of magnitude faster. Availability: Our software will be made available in the next release the program Mendel at http://www.genetics.ucla.edu/software/. Contact:kayers@ucla.eduKeywords
This publication has 17 references indexed in Scilit:
- A dictionary model for haplotyping, genotype calling, and association testingGenetic Epidemiology, 2007
- A Fast and Flexible Statistical Model for Large-Scale Population Genotype Data: Applications to Inferring Missing Genotypes and Haplotypic PhaseAmerican Journal of Human Genetics, 2006
- A Comparison of Phasing Algorithms for Trios and Unrelated IndividualsAmerican Journal of Human Genetics, 2006
- Accounting for Decay of Linkage Disequilibrium in Haplotype Inference and Missing-Data ImputationAmerican Journal of Human Genetics, 2005
- Haplotype reconstruction from genotype data using Imperfect PhylogenyBioinformatics, 2004
- A Tutorial on MM AlgorithmsThe American Statistician, 2004
- Partition-Ligation–Expectation-Maximization Algorithm for Haplotype Inference with Single-Nucleotide PolymorphismsAmerican Journal of Human Genetics, 2002
- Haplotypes vs single marker linkage disequilibrium tests: what do we gain?European Journal of Human Genetics, 2001
- A New Statistical Method for Haplotype Reconstruction from Population DataAmerican Journal of Human Genetics, 2001
- Atomic Decomposition by Basis PursuitSIAM Journal on Scientific Computing, 1998