Constrained iterative speech enhancement with application to speech recognition
- 1 April 1991
- journal article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Signal Processing
- Vol. 39 (4) , 795-805
- https://doi.org/10.1109/78.80901
Abstract
The basis of an improved form of iterative speech enhancement for single-channel inputs is sequential maximum a posteriori estimation of the speech waveform and its all-pole parameters, followed by imposition of constraints upon the sequence of speech spectra. The approaches impose intraframe and interframe constraints on the input speech signal. Properties of the line spectral pair representation of speech allow for an efficient and direct procedure for application of many of the constraint requirements. Substantial improvement over the unconstrained method is observed in a variety of domains. Informed listener quality evaluation tests and objective speech quality measures demonstrate the technique's effectiveness for additive white Gaussian noise. A consistent terminating point of the iterative technique is shown. The current systems result in substantially improved speech quality and linear predictive coding (LPC) parameter estimation with only a minor increase in computational requirements. The algorithms are evaluated with respect to improving automatic recognition of speech in the presence of additive noise and shown to outperform other enhancement methods in this application.Keywords
This publication has 7 references indexed in Scilit:
- Iterative speech enhancement with spectral constraintsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- Line spectrum pair (LSP) and speech data compressionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- Constrained iterative speech enhancement with application to automatic speech recognitionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- Objective quality measures applied to enhanced speechThe Journal of the Acoustical Society of America, 1985
- Suppression of acoustic noise in speech using spectral subtractionIEEE Transactions on Acoustics, Speech, and Signal Processing, 1979
- All-pole modeling of degraded speechIEEE Transactions on Acoustics, Speech, and Signal Processing, 1978
- Line spectrum representation of linear predictor coefficients of speech signalsThe Journal of the Acoustical Society of America, 1975