RASTA processing of speech

1 January 1994

journal article
Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Speech and Audio Processing

Vol. 2 (4) , 578-589
https://doi.org/10.1109/89.326616

Abstract

Performance of even the best current stochastic recognizers severely degrades in an unexpected communications environment. In some cases, the environmental effect can be modeled by a set of simple transformations and, in particular, by convolution with an environmental impulse response and the addition of some environmental noise. Often, the temporal properties of these environmental effects are quite different from the temporal properties of speech. We have been experimenting with filtering approaches that attempt to exploit these differences to produce robust representations for speech recognition and enhancement and have called this class of representations relative spectra (RASTA). In this paper, we review the theoretical and experimental foundations of the method, discuss the relationship with human auditory perception, and extend the original method to combinations of additive noise and convolutional noise. We discuss the relationship between RASTA features and the nature of the recognition models that are required and the relationship of these features to delta features and to cepstral mean subtraction. Finally, we show an application of the RASTA technique to speech enhancement

Keywords

This publication has 17 references indexed in Scilit:

Optimization of perceptually-based ASR front-end (automatic speech recognition)
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2003
Integrating RASTA-PLP into speech recognition
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
RASTA processing of speech
IEEE Transactions on Speech and Audio Processing, 1994
Hidden Markov models with templates as non-stationary states: an application to speech recognition
Computer Speech & Language, 1993
Comparative experiments on large vocabulary speech recognition
Published by Association for Computational Linguistics (ACL) ,1993
Perceptual linear predictive (PLP) analysis of speech
The Journal of the Acoustical Society of America, 1990
Noise adaptation in a hidden Markov model speech recognition system
Computer Speech & Language, 1989
Auditory enhancement of changes in spectral amplitude
The Journal of the Acoustical Society of America, 1987
Suppression of acoustic noise in speech using spectral subtraction
IEEE Transactions on Acoustics, Speech, and Signal Processing, 1979
Differential Intensity Sensitivity of the Ear for Pure Tones
Physical Review B, 1928