On the inference of ancestries in admixed populations
- 18 March 2008
- journal article
- Published by Cold Spring Harbor Laboratory in Genome Research
- Vol. 18 (4) , 668-675
- https://doi.org/10.1101/gr.072751.107
Abstract
Inference of ancestral information in recently admixed populations, in which every individual is composed of a mixed ancestry (e.g., African Americans in the United States), is a challenging problem. Several previous model-based approaches to admixture have been based on hidden Markov models (HMMs) and Markov hidden Markov models (MHMMs). We present an augmented form of these models that can be used to predict historical recombination events and can model background linkage disequilibrium (LD) more accurately. We also study some of the computational issues that arise in using such Markovian models on realistic data sets. In particular, we present an effective initialization procedure that, when combined with expectation-maximization (EM) algorithms for parameter estimation, yields high accuracy at significantly decreased computational cost relative to the Markov chain Monte Carlo (MCMC) algorithms that have generally been used in earlier studies. We present experiments exploring these modeling and algorithmic issues in two scenarios—the inference of locus-specific ancestries in a population that is assumed to originate from two unknown ancestral populations, and the inference of allele frequencies in one ancestral population given those in another.Keywords
This publication has 10 references indexed in Scilit:
- Estimating Local Ancestry in Admixed PopulationsAmerican Journal of Human Genetics, 2008
- A Genomewide Single-Nucleotide–Polymorphism Panel with High Ancestry Information for African American Admixture MappingAmerican Journal of Human Genetics, 2006
- Principal components analysis corrects for stratification in genome-wide association studiesNature Genetics, 2006
- Reconstructing Genetic Ancestry Blocks in Admixed IndividualsAmerican Journal of Human Genetics, 2006
- Evaluating potential for whole-genome studies in Kosrae, an isolated population in MicronesiaNature Genetics, 2006
- Methods for High-Density Admixture Mapping of Disease GenesAmerican Journal of Human Genetics, 2004
- Design and Analysis of Admixture Mapping StudiesAmerican Journal of Human Genetics, 2004
- Faculty Opinions recommendation of Inference of population structure using multilocus genotype data: linked loci and correlated allele frequencies.Published by H1 Connect ,2003
- Inference of Population Structure Using Multilocus Genotype Data: Linked Loci and Correlated Allele FrequenciesGenetics, 2003
- General Methods for Monitoring Convergence of Iterative SimulationsJournal of Computational and Graphical Statistics, 1998