Signal conditioning techniques for robust speech recognition

1 April 1996

journal article
Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Signal Processing Letters

Vol. 3 (4) , 107-109
https://doi.org/10.1109/97.489062

Abstract

Acoustic mismatch encountered in various training and testing conditions of hidden Markov model (HMM) based systems often causes severe degradation in speech recognition performance. For telephone based speech recognition tasks, acoustic mismatch can arise from various sources, such as variations in telephone handsets, ambient noise, and channel distortions. This paper presents three techniques for blind channel equalization, namely, cepstral mean subtraction (CMS), signal bias removal (SBR) and hierarchical signal bias removal (HSBR). Experimental results on various connected digits databases show a reduction in the digit error rate by 16%, 21%, and 28% when employing CMS, SBR, and HSBR, respectively. Our results also demonstrate that the HSBR technique outperforms SBR and CMS on every sub-data collection and exhibits consistent improvements even for short utterances.

Keywords

This publication has 4 references indexed in Scilit:

Signal bias removal for robust telephone based speech recognition in adverse environments
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
Context-dependent acoustic subword modeling for connected digit recognition
The Journal of the Acoustical Society of America, 1993
Noise adaptation algorithms for robust speech recognition
Speech Communication, 1993
Speaker-independent isolated word recognition using dynamic features of speech spectrum
IEEE Transactions on Acoustics, Speech, and Signal Processing, 1986