Speech recognition using segmental neural nets

1 January 1992

conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE)

Vol. 1 (15206149) , 625-628 vol.1
https://doi.org/10.1109/icassp.1992.225831

Abstract

The authors present the concept of a segmental neural net (SNN) for phonetic modeling in continuous speech recognition (CSR) and demonstrate how this can be used with a multiple hypothesis (or N-Best) paradigm to combine different CSR systems. In particular, the authors developed a system that combines the SNN with a hidden Markov model (HMM) system. In a speaker-independent, 1000-word CSR test using a word-pair grammar, the error rate for the hybrid system dropped 25% from that of a state-of-the-art HMM system alone. By taking into account all the frames of a phonetic segment simultaneously, the SNN overcomes the well-known conditional-independence limitation of HMMs. The hybrid SNN/HMM system generates likely phonetic segmentations from the HMM N-best list, which are scored by the SNN. The HMM and SNN scores are then combined to optimize performance.<>

Keywords

This publication has 7 references indexed in Scilit:

Some statistical issues in the comparison of speech recognition algorithms
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2003
The N-best algorithms: an efficient and exact procedure for finding the N most likely sentence hypotheses
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
Integration of diverse recognition methodologies through reevaluation of N-best sentence hypotheses
Published by Association for Computational Linguistics (ACL) ,1991
A comparison of several approximate algorithms for finding multiple (N-best) sentence hypotheses
Published by Institute of Electrical and Electronics Engineers (IEEE) ,1991
A new error criterion for posterior probability estimation with neural nets
Published by Institute of Electrical and Electronics Engineers (IEEE) ,1990
Toward a real-time spoken language system using commercial hardware
Published by Association for Computational Linguistics (ACL) ,1990
A stochastic segment model for phoneme-based continuous speech recognition
IEEE Transactions on Acoustics, Speech, and Signal Processing, 1989