Combining Phylogenetic and Hidden Markov Models in Biosequence Analysis
- 1 March 2004
- journal article
- research article
- Published by Mary Ann Liebert Inc in Journal of Computational Biology
- Vol. 11 (2-3) , 413-428
- https://doi.org/10.1089/1066527041410472
Abstract
A few models have appeared in recent years that consider not only the way substitutions occur through evolutionary history at each site of a genome, but also the way the process changes from one site to the next. These models combine phylogenetic models of molecular evolution, which apply to individual sites, and hidden Markov models, which allow for changes from site to site. Besides improving the realism of ordinary phylogenetic models, they are potentially very powerful tools for inference and prediction—for example, for gene finding or prediction of secondary structure. In this paper, we review progress on combined phylogenetic and hidden Markov models and present some extensions to previous work. Our main result is a simple and efficient method for accommodating higher-order states in the HMM, which allows for context-dependent models of substitution—that is, models that consider the effects of neighboring bases on the pattern of substitution. We present experimental results indicating that higher-order states, autocorrelated rates, and multiple functional categories all lead to significant improvements in the fit of a combined phylogenetic and hidden Markov model, with the effect of higher-order states being particularly pronounced.Keywords
This publication has 41 references indexed in Scilit:
- Comparative analyses of multi-species sequences from targeted genomic regionsNature, 2003
- Initial sequencing and comparative analysis of the mouse genomeNature, 2002
- Detection of Recombination in DNA Multiple Alignments with Hidden Markov ModelsJournal of Computational Biology, 2001
- Evidence for a High Frequency of Simultaneous Double-Nucleotide SubstitutionsScience, 2000
- Profile hidden Markov models.Bioinformatics, 1998
- Prediction of complete gene structures in human genomic DNAJournal of Molecular Biology, 1997
- Using Evolutionary Trees in Protein Secondary Structure Prediction and Other Comparative Sequence AnalysesJournal of Molecular Biology, 1996
- A Stochastic Model for the Evolution of Autocorrelated DNA SequencesMolecular Phylogenetics and Evolution, 1994
- Hidden Markov Models in Computational BiologyJournal of Molecular Biology, 1994
- Evolutionary trees from DNA sequences: A maximum likelihood approachJournal of Molecular Evolution, 1981