Acoustic Markov models used in the Tangora speech recognition system
- 6 January 2003
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
Abstract
The Speech Recognition Group at IBM Research has developed a real-time, isolated-word speech recognizer called Tangora, which accepts natural English sentences drawn from a vocabulary of 20000 words. Despite its large vocabulary, the Tangora recognizer requires only about 20 minutes of speech from each new user for training purposes. The accuracy of the system and its ease of training are largely attributable to the use of hidden Markov models in its acoustic match component. An automatic technique for constructing Markov word models is described and results are included of experiments with speaker-dependent and speaker-independent models on several isolated-word recognition tasks Author(s) Bahl, L.R. IBM Thomas J. Watson Res. Center, Yorktown Heights, NY Brown, P.F. ; de Souza, P.V. ; Picheny, M.A.Keywords
This publication has 17 references indexed in Scilit:
- Improved hidden Markov modeling of phonemes for continuous speech recognitionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- Application of an adaptive auditory model to speech recognitionThe Journal of the Acoustical Society of America, 1985
- A speaker-independent, syntax-directed, connected word recognition system based on hidden Markov models and level buildingIEEE Transactions on Acoustics, Speech, and Signal Processing, 1985
- Vector quantization in speech codingProceedings of the IEEE, 1985
- Vector quantizationIEEE ASSP Magazine, 1984
- A Maximum Likelihood Approach to Continuous Speech RecognitionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1983
- Dynamic programming algorithm optimization for spoken word recognitionIEEE Transactions on Acoustics, Speech, and Signal Processing, 1978
- Continuous speech recognition by statistical methodsProceedings of the IEEE, 1976
- Design of a linguistic statistical decoder for the recognition of continuous speechIEEE Transactions on Information Theory, 1975
- Minimum prediction residual principle applied to speech recognitionIEEE Transactions on Acoustics, Speech, and Signal Processing, 1975