Accuracies of ancestral amino acid sequences inferred by the parsimony, likelihood, and distance methods
- 1 January 1997
- journal article
- research article
- Published by Springer Nature in Journal of Molecular Evolution
- Vol. 44 (S1) , S139-S146
- https://doi.org/10.1007/pl00000067
Abstract
Information about protein sequences of ancestral organisms is important for identifying critical amino acid substitutions that have caused the functional change of proteins in evolution. Using computer simulation, we studied the accuracy of ancestral amino acids inferred by two currently available methods (maximum-parsimony [MP] and maximum-likelihood [ML] methods) in addition to a distance method, which was newly developed in this paper. All three methods give reliable inference when the divergence of amino acid sequences is low. When the extent of sequence divergence is high, however, the ML and distance methods give more accurate results than the MP method, particularly when the phylogenetic tree includes long branches. The accuracy of inferred ancestral amino acids does not change very much when a few present-day sequences are added or eliminated. When an incorrect model of amino acid substitution is used for the ML and distance methods, the accuracy decreases, but it is still higher than that for the MP method. When the tree topology used is partially incorrect, the accuracy in the correct part of the tree is virtually unaffected. The posterior probability of inferred ancestral amino acids computed by the ML and distance methods is an unbiased estimate of the true probability when a correct substitution model is used but may become an overestimate when a simpler model is used.Keywords
This publication has 23 references indexed in Scilit:
- Angiotensin II-Forming Activity in a Reconstructed Ancestral ChymaseScience, 1996
- Uncertainty in ancient phylogeniesNature, 1995
- Reconstructing the evolutionary history of the artiodactyl ribonuclease superfamilyNature, 1995
- Tests of applicability of several substitution models for DNA sequence data.Molecular Biology and Evolution, 1995
- Phylogenetic relationships among eutherian orders estimated from inferred sequences of mitochondrial proteins: Instability of a tree based on a single geneJournal of Molecular Evolution, 1994
- Estimating the pattern of nucleotide substitutionJournal of Molecular Evolution, 1994
- The rapid generation of mutation data matrices from protein sequencesBioinformatics, 1992
- Evolutionary trees from DNA sequences: A maximum likelihood approachJournal of Molecular Evolution, 1981
- Minimum Mutation Fits to a Given TreePublished by JSTOR ,1973
- Toward Defining the Course of Evolution: Minimum Change for a Specific Tree TopologySystematic Zoology, 1971