Recent advances in speech processing
- 13 January 2003
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
Abstract
An overview is given of recent advances in the domain of speech recognition. The author focuses on speech recognition, but also mentions some progress in other areas of speech processing (speaker recognition, speech synthesis, speech analysis and coding) using similar methodologies. The problems related to automatic speech processing are identified, and the initial approaches that have been followed in order to address those problems are described. The author then introduces the methodological novelties that allowed for progress along three axes: from isolated-word recognition to continuous speech, from speaker-dependent recognition to speaker-independent, and from small vocabularies to large vocabularies. Special emphasis centers on the improvements made possible by Markov models and, more recently, by connectionist models, resulting in improved performance for difficult vocabularies or in more robust systems. Some specialized hardware is described, as are efforts aimed at assessing speech-recognition systems.Keywords
This publication has 94 references indexed in Scilit:
- Hamlet: a prototype of a voice-activated typewriterIEE Proceedings I (Communications, Speech and Vision), 1989
- Network-based connected digit recognitionIEEE Transactions on Acoustics, Speech, and Signal Processing, 1987
- On Turing's formula for word probabilitiesIEEE Transactions on Acoustics, Speech, and Signal Processing, 1985
- Network-based isolated digit recognition using vector quantizationIEEE Transactions on Acoustics, Speech, and Signal Processing, 1985
- A speaker-independent, syntax-directed, connected word recognition system based on hidden Markov models and level buildingIEEE Transactions on Acoustics, Speech, and Signal Processing, 1985
- The use of a one-stage dynamic programming algorithm for connected word recognitionIEEE Transactions on Acoustics, Speech, and Signal Processing, 1984
- Optimization by Simulated AnnealingScience, 1983
- A simplified, robust training procedure for speaker trained, isolated word recognition systemsThe Journal of the Acoustical Society of America, 1980
- Two-level DP-matching--A dynamic programming-based pattern matching algorithm for connected word recognitionIEEE Transactions on Acoustics, Speech, and Signal Processing, 1979
- The DRAGON system--An overviewIEEE Transactions on Acoustics, Speech, and Signal Processing, 1975