A speech recognizer using radial basis function neural networks in an HMM framework
- 1 January 1992
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- Vol. 1, 629-632 vol.1
- https://doi.org/10.1109/icassp.1992.225830
Abstract
A high performance speaker-independent isolated-word speech recognizer was developed which combines hidden Markov models (HMMs) and radial basis function (RBF) neural networks. RBF networks in this recognizer use discriminant training techniques to estimate Bayesian probabilities for each speech frame while HMM decoders estimate overall word likelihood scores for network outputs. RBF training is performed after the HMM recognizer has automatically segmented training tokens using forced Viterbi alignment. In recognition experiments using a speaker-independent E-set database, the hybrid recognizer had an error rate of 11.5% compared to 15.7% for the robust unimodal Gaussian HMM recognizer upon which the hybrid system was based. The error rate was also lower than that of a tied-mixture HMM recognizer with the same number of centers. These results demonstrate that RBF networks can be successfully incorporated in hybrid recognizers and suggest that they may be capable of good performance with fewer parameters than required by Gaussian mixture classifiers.<>Keywords
This publication has 4 references indexed in Scilit:
- Multi-style training for robust isolated-word speech recognitionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- Connectionist Viterbi training: a new hybrid method for continuous speech recognitionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- The Lincoln tied-mixture HMM continuous speech recognizerPublished by Association for Computational Linguistics (ACL) ,1990
- Pattern classification using neural networksIEEE Communications Magazine, 1989