Speech recognition using segmental neural nets
- 1 January 1992
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- Vol. 1 (15206149) , 625-628 vol.1
- https://doi.org/10.1109/icassp.1992.225831
Abstract
The authors present the concept of a segmental neural net (SNN) for phonetic modeling in continuous speech recognition (CSR) and demonstrate how this can be used with a multiple hypothesis (or N-Best) paradigm to combine different CSR systems. In particular, the authors developed a system that combines the SNN with a hidden Markov model (HMM) system. In a speaker-independent, 1000-word CSR test using a word-pair grammar, the error rate for the hybrid system dropped 25% from that of a state-of-the-art HMM system alone. By taking into account all the frames of a phonetic segment simultaneously, the SNN overcomes the well-known conditional-independence limitation of HMMs. The hybrid SNN/HMM system generates likely phonetic segmentations from the HMM N-best list, which are scored by the SNN. The HMM and SNN scores are then combined to optimize performance.<>Keywords
This publication has 7 references indexed in Scilit:
- Some statistical issues in the comparison of speech recognition algorithmsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- The N-best algorithms: an efficient and exact procedure for finding the N most likely sentence hypothesesPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Integration of diverse recognition methodologies through reevaluation of N-best sentence hypothesesPublished by Association for Computational Linguistics (ACL) ,1991
- A comparison of several approximate algorithms for finding multiple (N-best) sentence hypothesesPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1991
- A new error criterion for posterior probability estimation with neural netsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1990
- Toward a real-time spoken language system using commercial hardwarePublished by Association for Computational Linguistics (ACL) ,1990
- A stochastic segment model for phoneme-based continuous speech recognitionIEEE Transactions on Acoustics, Speech, and Signal Processing, 1989