Maximum-likelihood models for combined analyses of multiple sequence data
- 1 May 1996
- journal article
- research article
- Published by Springer Nature in Journal of Molecular Evolution
- Vol. 42 (5) , 587-596
- https://doi.org/10.1007/bf02352289
Abstract
Models of nucleotide substitution were constructed for combined analyses of heterogeneous sequence data (such as those of multiple genes) from the same set of species. The models account for different aspects of the heterogeneity in the evolutionary process of different genes, such as differences in nucleotide frequencies, in substitution rate bias (for example, the transition/transversion rate bias), and in the extent of rate variation across sites. Model parameters were estimated by maximum likelihood and the likelihood ratio test was used to test hypotheses concerning sequence evolution, such as rate constancy among lineages (the assumption of a molecular clock) and proportionality of branch lengths for different genes. The example data from a segment of the mitochondrial genome of six hominoid species (human, common and pygmy chimpanzees, gorilla, orangutan, and siamang) were analyzed. Nucleotides at the three codon positions in the protein-coding regions and from the tRNA-coding regions were considered heterogeneous data sets. Statistical tests showed that the amount of evolution in the sequence data reflected in the estimated branch lengths can be explained by the codon-position effect and lineage effect of substitution rates. The assumption of a molecular clock could not be rejected when the data were analyzed separately or when the rate variation among sites was ignored. However, significant differences in substitution rate among lineages were found when the data sets were combined and when the rate variation among sites was accounted for in the models. Under the assumption that the orangutan and African apes diverged 13 million years ago, the combined analysis of the sequence data estimated the times for the human-chimpanzee separation and for the separation of the gorilla as 4.3 and 6.8 million years ago, respectively.Keywords
This publication has 29 references indexed in Scilit:
- Success of maximum likelihood phylogeny inference in the four-taxon case.Molecular Biology and Evolution, 1995
- Estimating the pattern of nucleotide substitutionJournal of Molecular Evolution, 1994
- Man's place in hominoidea revealed by mitochondrial DNA genealogyJournal of Molecular Evolution, 1992
- Evaluation of the maximum likelihood estimate of the evolutionary tree topologies from DNA sequence data, and the branching order in hominoideaJournal of Molecular Evolution, 1989
- PHYLOGENIES FROM MOLECULAR SEQUENCES: INFERENCE AND RELIABILITYAnnual Review of Genetics, 1988
- Asymptotic Properties of Maximum Likelihood Estimators and Likelihood Ratio Tests under Nonstandard ConditionsJournal of the American Statistical Association, 1987
- An empirical investigation of some effects of sparseness in contingency tablesComputational Statistics & Data Analysis, 1987
- Distinguished Lecture: Hominoid Evolution and Hominoid OriginsAmerican Anthropologist, 1986
- Evolutionary trees from DNA sequences: A maximum likelihood approachJournal of Molecular Evolution, 1981
- Log-Linear Models and Frequency Tables with Small Expected Cell CountsThe Annals of Statistics, 1977