Recombination Estimation Under Complex Evolutionary Models with the Coalescent Composite-Likelihood Method
- 1 February 2006
- journal article
- research article
- Published by Oxford University Press (OUP) in Molecular Biology and Evolution
- Vol. 23 (4) , 817-827
- https://doi.org/10.1093/molbev/msj102
Abstract
The composite-likelihood estimator (CLE) of the population recombination rate considers only sites with exactly two alleles under a finite-sites mutation model (McVean, G. A. T., P. Awadalla, and P. Fearnhead. 2002. A coalescent-based method for detecting and estimating recombination from gene sequences. Genetics 160:1231–1241). While in such a model the identity of alleles is not considered, the CLE has been shown to be robust to minor misspecification of the underlying mutational model. However, there are many situations where the putative mutation and demographic history can be quite complex. One good example is rapidly evolving pathogens, like HIV-1. First we evaluated the performance of the CLE and the likelihood permutation test (LPT) under more complex, realistic models, including a general time reversible (GTR) substitution model, rate heterogeneity among sites (Γ), positive selection, population growth, population structure, and noncontemporaneous sampling. Second, we relaxed some of the assumptions of the CLE allowing for a four-allele, GTR+Γ model in an attempt to use the data more efficiently. Through simulations and the analysis of real data, we concluded that the CLE is robust to severe misspecifications of the substitution model, but underestimates the recombination rate in the presence of exponential growth, population mixture, selection, or noncontemporaneous sampling. In such cases, the use of more complex models slightly increases performance in some occasions, especially in the case of the LPT. Thus, our results provide for a more robust application of the estimation of recombination rates.Keywords
This publication has 37 references indexed in Scilit:
- Estimating recombination rates from population-genetic dataNature Reviews Genetics, 2003
- The evolutionary genomics of pathogen recombinationNature Reviews Genetics, 2003
- Recombination in Evolutionary GenomicsAnnual Review of Genetics, 2002
- Immune-Mediated Positive Selection Drives Human Immunodeficiency Virus Type 1 Molecular Variation and Predicts Disease DurationJournal of Virology, 2002
- Linkage disequilibrium and the mapping of complex human traitsTrends in Genetics, 2002
- Estimating Mutation Rate and Generation Time from Longitudinal Samples of DNA SequencesMolecular Biology and Evolution, 2001
- Parallel evolution of drug resistance in HIV: failure of nonsynonymous/synonymous substitution rate ratio to detect selectionMolecular Biology and Evolution, 1999
- Genealogies and weak purifying selection.Molecular Biology and Evolution, 1999
- Maximum-likelihood estimation of phylogeny from DNA sequences when substitution rates differ over sites.Molecular Biology and Evolution, 1993
- On the number of segregating sites in genetical models without recombinationTheoretical Population Biology, 1975