Adaptive source generator compensation and enhancement for speech recognition in noisy stressful environments
- 1 January 1993
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- Vol. 2 (15206149) , 95-98 vol.2
- https://doi.org/10.1109/icassp.1993.319239
Abstract
The author describes a low-vocabulary speech recognition algorithm which provides robust performance in noisy environments with particular emphasis on characteristics due to stress. A stressed speech source generator framework is formulated to achieve robust speech parameter characterization using a morphological constrained enhancement algorithm and stressed source compensation which is unique for each source generator across a stressed speaking class. An estimated source generator class sequence allows noise parameter enhancement and stress compensation schemes to adapt to changing speech generator types. A phonetic consistency rule is also employed based on input source generator partitioning. Average recognition rates for noisy stressful speech are shown to increase from an average 36.7% for a baseline recognizer to 74.7% for the new recognition algorithm. The new algorithm is also more consistent under varying noisy conditions as demonstrated by a decrease in standard deviation of recognition from 21.1 to 11.9, and a reduction in confusable word-pairs under noisy, stressed speaking conditions.Keywords
This publication has 11 references indexed in Scilit:
- A speaker-stress resistant HMM isolated word recognizerPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- Multi-style training for robust isolated-word speech recognitionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- Stress compensation and noise reduction algorithms for robust speech recognitionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- Acoustic-phonetic analysis of loud and Lombard speech in simulated cockpit conditionsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- Morphological constrained feature enhancement with adaptive cepstral compensation (MCE-ACC) for speech recognition in noise and Lombard effectIEEE Transactions on Speech and Audio Processing, 1994
- Speech recognition in adverse environmentsComputer Speech & Language, 1991
- Constrained iterative speech enhancement with application to speech recognitionIEEE Transactions on Signal Processing, 1991
- Morphological filters--Part I: Their set-theoretic analysis and relations to linear shift-invariant filtersIEEE Transactions on Acoustics, Speech, and Signal Processing, 1987
- Suppression of acoustic noise in speech using spectral subtractionIEEE Transactions on Acoustics, Speech, and Signal Processing, 1979
- Effect of Noise, System Gain, and Assigned Task on Talking Levels in Loudspeaker CommunicationThe Journal of the Acoustical Society of America, 1966