Evaluation of methods for detecting recombination from DNA sequences: Computer simulations
Top Cited Papers
- 20 November 2001
- journal article
- research article
- Published by Proceedings of the National Academy of Sciences in Proceedings of the National Academy of Sciences
- Vol. 98 (24) , 13757-13762
- https://doi.org/10.1073/pnas.241370698
Abstract
Recombination is a key evolutionary process that shapes the architecture of genomes and the genetic structure of populations. Although many statistical methods are available for the detection of recombination from DNA sequences, their absolute and relative performance is still unknown. Here we evaluated the performance of 14 different recombination detection algorithms. We used the coalescent with recombination to simulate DNA sequences with different levels of recombination, genetic diversity, and rate variation among sites. Recombination detection methods were applied to these data sets, and whether they detected or not recombination was recorded. Different recombination methods showed distinct performance depending on the amount of recombination, genetic diversity, and rate variation among sites. The model of nucleotide substitution under which the data were generated did not seem to have a significant effect. Most methods increase power with more sequence divergence. In general, recombination detection methods seem to capture the presence of recombination, but they are not very powerful. Methods that use substitution patterns or incompatibility among sites were more powerful than methods based on phylogenetic incongruence. Most methods do not seem to infer more false positives than expected by chance. Especially depending on the amount of diversity in the data, different methods could be used to attain maximum power while minimizing false positives. Results shown here will provide some guidance in the selection of the most appropriate method/s for the analysis of the particular data at hand.Keywords
This publication has 58 references indexed in Scilit:
- Recombination in the Hemagglutinin Gene of the 1918 "Spanish Flu"Science, 2001
- Genotyping, gene genealogies and genomics bring fungal population genetics above groundTrends in Ecology & Evolution, 1998
- Among-site rate variation and its impact on phylogenetic analysesTrends in Ecology & Evolution, 1996
- A program for calculating and displaying compatibility matrices as an aid in determining reticulate evolution in molecular sequencesBioinformatics, 1996
- Ancestral Inference from Samples of DNA Sequences with RecombinationJournal of Computational Biology, 1996
- Identification of Breakpoints in Intergenotypic Recombinants of HIV Type 1 by BootscanningAIDS Research and Human Retroviruses, 1995
- Gene conversion in the evolution of the human and chimpanzee MHC class I lociTissue Antigens, 1991
- Evolutionary trees from DNA sequences: A maximum likelihood approachJournal of Molecular Evolution, 1981
- Neutral two-locus multiple allele models with recombinationTheoretical Population Biology, 1981
- A simple method for estimating evolutionary rates of base substitutions through comparative studies of nucleotide sequencesJournal of Molecular Evolution, 1980