Bayesian Variable Selection for Detecting Adaptive Genomic Differences Among Populations
- 1 March 2008
- journal article
- Published by Oxford University Press (OUP) in Genetics
- Vol. 178 (3) , 1817-1829
- https://doi.org/10.1534/genetics.107.081281
Abstract
We extend an Fst-based Bayesian hierarchical model, implemented via Markov chain Monte Carlo, for the detection of loci that might be subject to positive selection. This model divides the Fst-influencing factors into locus-specific effects, population-specific effects, and effects that are specific for the locus in combination with the population. We introduce a Bayesian auxiliary variable for each locus effect to automatically select nonneutral locus effects. As a by-product, the efficiency of the original approach is improved by using a reparameterization of the model. The statistical power of the extended algorithm is assessed with simulated data sets from a Wright–Fisher model with migration. We find that the inclusion of model selection suggests a clear improvement in discrimination as measured by the area under the receiver operating characteristic (ROC) curve. Additionally, we illustrate and discuss the quality of the newly developed method on the basis of an allozyme data set of the fruit fly Drosophila melanogaster and a sequence data set of the wild tomato Solanum chilense. For data sets with small sample sizes, high mutation rates, and/or long sequences, however, methods based on nucleotide statistics should be preferred.Keywords
This publication has 32 references indexed in Scilit:
- Inferring the Demographic History and Rate of Adaptive Substitution in DrosophilaPLoS Genetics, 2006
- Potential selection in native grass populations by exotic invasionMolecular Ecology, 2006
- Bayesian auxiliary variable models for binary and multinomial regressionBayesian Analysis, 2006
- Explorative Genome Scan to Detect Candidate Loci for Adaptation Along a Gradient of Altitude in the Common Frog (Rana temporaria)Molecular Biology and Evolution, 2006
- Multivariate Classification Rules: Calibration and DiscriminationPublished by Wiley ,2005
- Expressed Sequence Tag-Linked Microsatellites as a Source of Gene-Associated Polymorphisms for Detecting Signatures of Divergent Selection in Atlantic Salmon (Salmo salar L.)Molecular Biology and Evolution, 2005
- DnaSP, DNA polymorphism analyses by the coalescent and other methodsBioinformatics, 2003
- Bayesian Computation and Stochastic SystemsStatistical Science, 1995
- Probabilistic prediction in patient management and clinical trialsStatistics in Medicine, 1986
- Analysis of Gene Diversity in Subdivided PopulationsProceedings of the National Academy of Sciences, 1973