Acoustic Markov models used in the Tangora speech recognition system

6 January 2003

conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE)

p. 497-500
https://doi.org/10.1109/icassp.1988.196628

Abstract

The Speech Recognition Group at IBM Research has developed a real-time, isolated-word speech recognizer called Tangora, which accepts natural English sentences drawn from a vocabulary of 20000 words. Despite its large vocabulary, the Tangora recognizer requires only about 20 minutes of speech from each new user for training purposes. The accuracy of the system and its ease of training are largely attributable to the use of hidden Markov models in its acoustic match component. An automatic technique for constructing Markov word models is described and results are included of experiments with speaker-dependent and speaker-independent models on several isolated-word recognition tasks Author(s) Bahl, L.R. IBM Thomas J. Watson Res. Center, Yorktown Heights, NY Brown, P.F. ; de Souza, P.V. ; Picheny, M.A.

Keywords

This publication has 17 references indexed in Scilit:

Improved hidden Markov modeling of phonemes for continuous speech recognition
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2005
Application of an adaptive auditory model to speech recognition
The Journal of the Acoustical Society of America, 1985
A speaker-independent, syntax-directed, connected word recognition system based on hidden Markov models and level building
IEEE Transactions on Acoustics, Speech, and Signal Processing, 1985
Vector quantization in speech coding
Proceedings of the IEEE, 1985
Vector quantization
IEEE ASSP Magazine, 1984
A Maximum Likelihood Approach to Continuous Speech Recognition
Published by Institute of Electrical and Electronics Engineers (IEEE) ,1983
Dynamic programming algorithm optimization for spoken word recognition
IEEE Transactions on Acoustics, Speech, and Signal Processing, 1978
Continuous speech recognition by statistical methods
Proceedings of the IEEE, 1976
Design of a linguistic statistical decoder for the recognition of continuous speech
IEEE Transactions on Information Theory, 1975
Minimum prediction residual principle applied to speech recognition
IEEE Transactions on Acoustics, Speech, and Signal Processing, 1975