Degraded word recognition based on segmental signal-to-noise ratio weighting

17 December 2002

conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE)

Vol. i (15206149) , I/425-I/428
https://doi.org/10.1109/icassp.1994.389265

Abstract

Distance measures robust against noise disturbances are required for reliable recognition of noisy speech. The local signal-to-noise ratio (SNR) of degraded speech varies in a wide range and the characteristics of speech with low SNR tend to be lost. Pattern matching, however, is performed uniformly without taking the local SNR of each analysis frame into account. The behavior of representative LPC distance measures versus segmental SNR is investigated, which shows the necessity of accounting for the effect of the segmental SNR on the distance measure. A double autocorrelation analysis is proposed as a spectrum estimation method. A pattern matching method is also introduced in which the segmental SNR is taken into account as a weight. Experiments of isolated word recognition were performed. The results show the effectiveness of the proposed method.

Keywords

This publication has 2 references indexed in Scilit:

The short-time modified coherence representation and noisy speech recognition
IEEE Transactions on Acoustics, Speech, and Signal Processing, 1989
Spectral slope distance measures with linear prediction analysis for word recognition in noise
IEEE Transactions on Acoustics, Speech, and Signal Processing, 1987