A Learning Method of Hidden Markov Models for Sequence Discrimination
- 1 January 1996
- journal article
- research article
- Published by Mary Ann Liebert Inc in Journal of Computational Biology
- Vol. 3 (3) , 361-373
- https://doi.org/10.1089/cmb.1996.3.361
Abstract
We propose a learning method for hidden Markov models (HMM) for sequence discrimination. When given an HMM, our method sets a function that corresponds to the product of a difference between the observed and the desired likelihoods for each training sequence, and using a gradient descent algorithm, trains the HMM parameters so that the function should be minimized. This method allows us to use not only the examples belonging to a class that should be represented by the HMM, but also the examples not belonging to the class, i.e., negative examples. We evaluated our method in a series of experiments based on a type of cross-validation, and compared the results with those of two existing methods. Experimental results show that our method greatly reduces the discrimination errors made by the other two methods. We conclude that both the use of negative examples and our method of using negative examples are useful for training HMMs in discriminating unknown sequences. Key words: hidden Markov models, sequence discrimination, lipocalin family, gradient-descent, stochastic models.Keywords
This publication has 20 references indexed in Scilit:
- The PROSITE database, its status in 1995Nucleic Acids Research, 1996
- RNA sequence analysis using covariance modelsNucleic Acids Research, 1994
- Smooth On-Line Learning Algorithms for Hidden Markov ModelsNeural Computation, 1994
- Hidden Markov models of biological primary sequence information.Proceedings of the National Academy of Sciences, 1994
- Hidden Markov Models of the G-Protein-Coupled Receptor FamilyJournal of Computational Biology, 1994
- The PROSITE dictionary of sites and patterns in proteins, its current statusNucleic Acids Research, 1993
- The SWISS-PROT protein sequence data bankNucleic Acids Research, 1991
- Alpha-nets: A recurrent ‘neural’ network architecture with a hidden Markov model interpretationSpeech Communication, 1990
- Stochastic models for heterogeneous DNA sequencesBulletin of Mathematical Biology, 1989
- The protein data bank: A computer-based archival file for macromolecular structuresJournal of Molecular Biology, 1977