RASTA processing of speech
- 1 January 1994
- journal article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Speech and Audio Processing
- Vol. 2 (4) , 578-589
- https://doi.org/10.1109/89.326616
Abstract
Performance of even the best current stochastic recognizers severely degrades in an unexpected communications environment. In some cases, the environmental effect can be modeled by a set of simple transformations and, in particular, by convolution with an environmental impulse response and the addition of some environmental noise. Often, the temporal properties of these environmental effects are quite different from the temporal properties of speech. We have been experimenting with filtering approaches that attempt to exploit these differences to produce robust representations for speech recognition and enhancement and have called this class of representations relative spectra (RASTA). In this paper, we review the theoretical and experimental foundations of the method, discuss the relationship with human auditory perception, and extend the original method to combinations of additive noise and convolutional noise. We discuss the relationship between RASTA features and the nature of the recognition models that are required and the relationship of these features to delta features and to cepstral mean subtraction. Finally, we show an application of the RASTA technique to speech enhancementKeywords
This publication has 17 references indexed in Scilit:
- Optimization of perceptually-based ASR front-end (automatic speech recognition)Published by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- Integrating RASTA-PLP into speech recognitionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- RASTA processing of speechIEEE Transactions on Speech and Audio Processing, 1994
- Hidden Markov models with templates as non-stationary states: an application to speech recognitionComputer Speech & Language, 1993
- Comparative experiments on large vocabulary speech recognitionPublished by Association for Computational Linguistics (ACL) ,1993
- Perceptual linear predictive (PLP) analysis of speechThe Journal of the Acoustical Society of America, 1990
- Noise adaptation in a hidden Markov model speech recognition systemComputer Speech & Language, 1989
- Auditory enhancement of changes in spectral amplitudeThe Journal of the Acoustical Society of America, 1987
- Suppression of acoustic noise in speech using spectral subtractionIEEE Transactions on Acoustics, Speech, and Signal Processing, 1979
- Differential Intensity Sensitivity of the Ear for Pure TonesPhysical Review B, 1928