Complete Families of Linear Invariants for Some Stochastic Models of Sequence Evolution, with and without the Molecular Clock Assumption
- 1 January 1996
- journal article
- research article
- Published by Mary Ann Liebert Inc in Journal of Computational Biology
- Vol. 3 (1) , 19-31
- https://doi.org/10.1089/cmb.1996.3.19
Abstract
For various models of sequence evolution, the set of linear functions of the frequencies of the nucleotide patterns forms a vector space, the invariant space. Here we distinguish between the model of nucleotide substitution, and the phylogenetic tree T describing the paths on which these changes occur. We describe a procedure to construct a basis of the invariant space for those models that are extensions of models incorporating Kimura's three substitution model of nucleotide change, including both the Jukes–Cantor and Cavender– Farris models. The dimension of the invariant space is determined, for those models where it is independent of the tree topology, as a function of the number of sequences. These are calculated where the nucleotide distribution at the root is unspecified, and both with, and without, the assumption of the molecular clock hypothesis. The invariants have a number of potential applications, including tree identification, and testing the fit of models (which could include the molecular clock) to sequence data.Keywords
This publication has 7 references indexed in Scilit:
- Linear invariants under Jukes’ and Cantor’s one-parameter modelJournal of Theoretical Biology, 1995
- Classifying and Counting Linear Phylogenetic Invariants for the Jukes–Cantor ModelJournal of Computational Biology, 1995
- A discrete Fourier analysis for evolutionary trees.Proceedings of the National Academy of Sciences, 1994
- A complete family of phylogenetic invariants for any number of taxa under Kimura's 3ST modelNew Zealand Journal of Botany, 1993
- Counting phylogenetic invariants in some simple casesJournal of Theoretical Biology, 1991
- Estimation of evolutionary distances between homologous nucleotide sequences.Proceedings of the National Academy of Sciences, 1981
- A simple method for estimating evolutionary rates of base substitutions through comparative studies of nucleotide sequencesJournal of Molecular Evolution, 1980