Recent advances in speech processing

Abstract

An overview is given of recent advances in the domain of speech recognition. The author focuses on speech recognition, but also mentions some progress in other areas of speech processing (speaker recognition, speech synthesis, speech analysis and coding) using similar methodologies. The problems related to automatic speech processing are identified, and the initial approaches that have been followed in order to address those problems are described. The author then introduces the methodological novelties that allowed for progress along three axes: from isolated-word recognition to continuous speech, from speaker-dependent recognition to speaker-independent, and from small vocabularies to large vocabularies. Special emphasis centers on the improvements made possible by Markov models and, more recently, by connectionist models, resulting in improved performance for difficult vocabularies or in more robust systems. Some specialized hardware is described, as are efforts aimed at assessing speech-recognition systems.

Keywords

This publication has 94 references indexed in Scilit:

Hamlet: a prototype of a voice-activated typewriter
IEE Proceedings I (Communications, Speech and Vision), 1989
Network-based connected digit recognition
IEEE Transactions on Acoustics, Speech, and Signal Processing, 1987
On Turing's formula for word probabilities
IEEE Transactions on Acoustics, Speech, and Signal Processing, 1985
Network-based isolated digit recognition using vector quantization
IEEE Transactions on Acoustics, Speech, and Signal Processing, 1985
A speaker-independent, syntax-directed, connected word recognition system based on hidden Markov models and level building
IEEE Transactions on Acoustics, Speech, and Signal Processing, 1985
The use of a one-stage dynamic programming algorithm for connected word recognition
IEEE Transactions on Acoustics, Speech, and Signal Processing, 1984
Optimization by Simulated Annealing
Science, 1983
A simplified, robust training procedure for speaker trained, isolated word recognition systems
The Journal of the Acoustical Society of America, 1980
Two-level DP-matching--A dynamic programming-based pattern matching algorithm for connected word recognition
IEEE Transactions on Acoustics, Speech, and Signal Processing, 1979
The DRAGON system--An overview
IEEE Transactions on Acoustics, Speech, and Signal Processing, 1975