Estimating effective population size and mutation rate from sequence data using Metropolis-Hastings sampling.
Open Access
- 1 August 1995
- journal article
- research article
- Published by Oxford University Press (OUP) in Genetics
- Vol. 140 (4) , 1421-1430
- https://doi.org/10.1093/genetics/140.4.1421
Abstract
We present a new way to make a maximum likelihood estimate of the parameter 4N mu (effective population size times mutation rate per site, or theta) based on a population sample of molecular sequences. We use a Metropolis-Hastings Markov chain Monte Carlo method to sample genealogies in proportion to the product of their likelihood with respect to the data and their prior probability with respect to a coalescent distribution. A specific value of theta must be chosen to generate the coalescent distribution, but the resulting trees can be used to evaluate the likelihood at other values of theta, generating a likelihood curve. This procedure concentrates sampling on those genealogies that contribute most of the likelihood, allowing estimation of meaningful likelihood curves based on relatively small samples. The method can potentially be extended to cases involving varying population size, recombination, and migration.Keywords
This publication has 4 references indexed in Scilit:
- Estimating effective population size from samples of sequences: inefficiency of pairwise and segregating sites as compared to phylogenetic estimatesGenetics Research, 1992
- Extensive mitochondrial diversity within a single Amerindian tribe.Proceedings of the National Academy of Sciences, 1991
- A simple method for estimating evolutionary rates of base substitutions through comparative studies of nucleotide sequencesJournal of Molecular Evolution, 1980
- On the number of segregating sites in genetical models without recombinationTheoretical Population Biology, 1975