Reconstructing Trees When Sequence Sites Evolve at Variable Rates
- 1 January 1994
- journal article
- Published by Mary Ann Liebert Inc in Journal of Computational Biology
- Vol. 1 (2) , 153-163
- https://doi.org/10.1089/cmb.1994.1.153
Abstract
For a sequence of colors independently evolving on a tree under a simple Markov model, we consider conditions under which the tree can be uniquely recovered from the "sequence spectrum"-the expected frequencies of the various leaf colorations. This is relevant for phylogenetic analysis (where colors represent nucleotides or amino acids; leaves represent extant taxa) as the sequence spectrum is estimated directly from a collection of aligned sequences. Allowing the rate of the evolutionary process to vary across sites is an important extension over most previous studies-we show that, given suitable restrictions on the rate distribution, the true tree (up to the placement of its root) is uniquely identified by its sequence spectrum. However, if the rate distribution is unknown and arbitrary, then, for simple models, it is possible for every tree to produce the same sequence spectrum. Hence there is a logical barrier to accurate, consistent phylogenetic inference for these models when assumptions about the rate distribution are not made. This result exploits a novel theorem on the action of polynomials with non-negative coefficients on sequences.Keywords
This publication has 16 references indexed in Scilit:
- Efficient algorithms for inferring evolutionary treesNetworks, 1991
- The general stochastic model of nucleotide substitutionJournal of Theoretical Biology, 1990
- The Relationship Between Simple Evolutionary Tree Models and Observable Sequence DataSystematic Zoology, 1989
- PHYLOGENIES FROM MOLECULAR SEQUENCES: INFERENCE AND RELIABILITYAnnual Review of Genetics, 1988
- Invariants of phylogenies in a simple case with discrete statesJournal of Classification, 1987
- Reconstructing the shape of a tree from observed dissimilarity dataAdvances in Applied Mathematics, 1986
- Evolutionary trees from DNA sequences: A maximum likelihood approachJournal of Molecular Evolution, 1981
- Cases in which Parsimony or Compatibility Methods Will be Positively MisleadingSystematic Zoology, 1978
- Taxonomy with confidenceMathematical Biosciences, 1978
- A Probability Model for Inferring Evolutionary TreesSystematic Zoology, 1973