Selecting Models of Nucleotide Substitution: An Application to Human Immunodeficiency Virus 1 (HIV-1)
Open Access
- 1 June 2001
- journal article
- research article
- Published by Oxford University Press (OUP) in Molecular Biology and Evolution
- Vol. 18 (6) , 897-906
- https://doi.org/10.1093/oxfordjournals.molbev.a003890
Abstract
The blind use of models of nucleotide substitution in evolutionary analyses is a common practice in the viral community. Typically, a simple model of evolution like the Kimura two-parameter model is used for estimating genetic distances and phylogenies, either because other authors have used it or because it is the default in various phylogenetic packages. Using two statistical approaches to model fitting, hierarchical likelihood ratio tests and the Akaike information criterion, we show that different viral data sets are better explained by different models of evolution. We demonstrate our results with the analysis of HIV-1 sequences from a hierarchy of samples; sequences within individuals, individuals within subtypes, and subtypes within groups. We also examine results for three different gene regions: gag, pol, and env. The Kimura two-parameter model was not selected as the best-fit model for any of these data sets, despite its widespread use in phylogenetic analyses of HIV-1 sequences. Furthermore, the model complexity increased with increasing sequence divergence. Finally, the molecular-clock hypothesis was rejected in most of the data sets analyzed, throwing into question clock-based estimates of divergence times for HIV-1. The importance of models in evolutionary analyses and their repercussions on the derived conclusions are discussed.Keywords
This publication has 58 references indexed in Scilit:
- Different Models, Different Trees: The Geographic Origin of PTLV-IMolecular Phylogenetics and Evolution, 1999
- Phylogenetic analysis of the env gene of HIV-1 isolates taking into account individual nucleotide substitution ratesAIDS, 1996
- On the maximum likelihood method in molecular phylogeneticsJournal of Molecular Evolution, 1991
- Mutation pattern of human immunodeficiency virus genesJournal of Molecular Evolution, 1991
- Robustness of maximum likelihood tree estimation against different patterns of base substitutionsJournal of Molecular Evolution, 1991
- The general stochastic model of nucleotide substitutionJournal of Theoretical Biology, 1990
- Asymptotic Properties of Maximum Likelihood Estimators and Likelihood Ratio Tests under Nonstandard ConditionsJournal of the American Statistical Association, 1987
- Evolutionary trees from DNA sequences: A maximum likelihood approachJournal of Molecular Evolution, 1981
- A simple method for estimating evolutionary rates of base substitutions through comparative studies of nucleotide sequencesJournal of Molecular Evolution, 1980
- A new look at the statistical model identificationIEEE Transactions on Automatic Control, 1974