Pseudo-Likelihood Analysis of Codon Substitution Models with Neighbor-Dependent Rates
- 1 November 2005
- journal article
- Published by Mary Ann Liebert Inc in Journal of Computational Biology
- Vol. 12 (9) , 1166-1182
- https://doi.org/10.1089/cmb.2005.12.1166
Abstract
Recently, Markov processes for the evolution of coding DNA with neighbor dependence in the instantaneous substitution rates have been considered. The neighbor dependency makes the models analytically intractable, and previously Markov chain Monte Carlo methods have been used for statistical inference. Using a pseudo-likelihood idea, we introduce in this paper an approximative estimation method which is fast to compute. The pseudo-likelihood estimates are shown to be very accurate, and from analyzing 348 human-mouse coding sequences we conclude that the incorporation of a CpG effect improves the fit of the model considerably.Keywords
This publication has 11 references indexed in Scilit:
- A nucleotide substitution model with nearest-neighbour interactionsBioinformatics, 2004
- Efficient approximations for learning phylogenetic HMM models from dataBioinformatics, 2004
- Bayesian Markov chain Monte Carlo sequence analysis reveals varying neutral substitution patterns in mammalian evolutionProceedings of the National Academy of Sciences, 2004
- Protein Evolution with Dependence Among Codons Due to Tertiary StructureMolecular Biology and Evolution, 2003
- DNA Sequence Evolution with Neighbor-Dependent MutationJournal of Computational Biology, 2003
- Codon and Rate Variation Models in Molecular PhylogenyMolecular Biology and Evolution, 2002
- An expectation maximization algorithm for training hidden substitution models 1 1Edited by F. CohenJournal of Molecular Biology, 2002
- A Dependent-Rates Model and an MCMC-Based Methodology for the Maximum-Likelihood Analysis of Sequences with Overlapping Reading FramesMolecular Biology and Evolution, 2001
- Probabilistic models of DNA sequence evolution with context dependent rates of substitutionAdvances in Applied Probability, 2000
- Statistical Analysis of Non-Lattice DataJournal of the Royal Statistical Society: Series D (The Statistician), 1975