The evolutionary forest algorithm
Open Access
- 22 May 2007
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 23 (15) , 1962-1968
- https://doi.org/10.1093/bioinformatics/btm264
Abstract
Motivation: Gene genealogies offer a powerful context for inferences about the evolutionary process based on presently segregating DNA variation. In many cases, it is the distribution of population parameters, marginalized over the effectively infinite-dimensional tree space, that is of interest. Our evolutionary forest (EF) algorithm uses Monte Carlo methods to generate posterior distributions of population parameters. A novel feature is the updating of parameter values based on a probability measure defined on an ensemble of histories (a forest of genealogies), rather than a single tree. Results: The EF algorithm generates samples from the correct marginal distribution of population parameters. Applied to actual data from closely related fruit fly species, it rapidly converged to posterior distributions that closely approximated the exact posteriors generated through massive computational effort. Applied to simulated data, it generated credible intervals that covered the actual parameter values in accordance with the nominal probabilities. Availability: A C++ implementation of this method is freely accessible at http://www.isds.duke.edu/~scl13 Contact:scotland@stat.duke.eduKeywords
This publication has 21 references indexed in Scilit:
- Likelihoods From Summary Statistics: Recent Divergence Between SpeciesGenetics, 2005
- Multilocus Methods for Estimating Population Sizes, Migration Rates and Divergence Time, With Applications to the Divergence of Drosophila pseudoobscura and D. persimilisGenetics, 2004
- Parallel Metropolis coupled Markov chain Monte Carlo for Bayesian phylogenetic inferenceBioinformatics, 2004
- Phylogeny estimation: traditional and Bayesian approachesNature Reviews Genetics, 2003
- Unrooted genealogical tree probabilities in the infinitely-many-sites modelMathematical Biosciences, 1995
- Sampling-Based Approaches to Calculating Marginal DensitiesJournal of the American Statistical Association, 1990
- Stochastic Relaxation, Gibbs Distributions, and the Bayesian Restoration of ImagesPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1984
- On the genealogy of large populationsJournal of Applied Probability, 1982
- The sampling theory of selectively neutral allelesTheoretical Population Biology, 1972
- Monte Carlo sampling methods using Markov chains and their applicationsBiometrika, 1970