Hybrid Modeling, HMM/NN Architectures, and Protein Applications
- 1 October 1996
- journal article
- Published by MIT Press in Neural Computation
- Vol. 8 (7) , 1541-1565
- https://doi.org/10.1162/neco.1996.8.7.1541
Abstract
We describe a hybrid modeling approach where the parameters of a model are calculated and modulated by another model, typically a neural network (NN), to avoid both overfitting and underfitting. We develop the approach for the case of Hidden Markov Models (HMMs), by deriving a class of hybrid HMM/NN architectures. These architectures can be trained with unified algorithms that blend HMM dynamic programming with NN backpropagation. In the case of complex data, mixtures of HMMs or modulated HMMs must be used. NNs can then be applied both to the parameters of each single HMM, and to the switching or modulation of the models, as a function of input or context. Hybrid HMM/NN architectures provide a flexible NN parameterization for the control of model structure and complexity. At the same time, they can capture distributions that, in practice, are inaccessible to single HMMs. The HMM/NN hybrid approach is tested, in its simplest form, by constructing a model of the immunoglobulin protein family. A hybrid model is trained, and a multiple alignment derived, with less than a fourth of the number of parameters used with previous single HMMs.Keywords
This publication has 9 references indexed in Scilit:
- Amino acid substitution matrices from an information theoretic perspectivePublished by Elsevier ,2005
- The Helmholtz MachineNeural Computation, 1995
- An HMM/MLP Architecture for Sequence RecognitionNeural Computation, 1995
- Hidden Markov models of biological primary sequence information.Proceedings of the National Academy of Sciences, 1994
- Improving the sensitivity of the sequence profile methodProtein Science, 1994
- Hidden Markov Models of the G-Protein-Coupled Receptor FamilyJournal of Computational Biology, 1994
- Connectionist learning of belief networksArtificial Intelligence, 1992
- Adaptive Mixtures of Local ExpertsNeural Computation, 1991
- A tutorial on hidden Markov models and selected applications in speech recognitionProceedings of the IEEE, 1989