Voice Activity Detection Based on Wavelet Packet Transform in Communication Nonlinear Channel
- 1 July 2009
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
Abstract
This paper presents a voice activity detection (VAD) algorithm based on the Wavelet Packet Transform and the Teager Energy Operation (TEO) processing. The signal is decomposed into subband signals. We used the multi-resolution analysis property of the Wavelet Transform to extract and analyse time-frequency components corresponding to speech. In order to obtain a parameter called Voice Activity Shape (VAS), we used TEO processing to better distinguish subband signals corresponding to speech. The subband variance values of each TEO signal are summed to obtain the VAS, which is higher in speech regions than in non speech regions. Experimental results show that our VAD perform better than the G729B, particularly in difficult noise conditions and also in the case when the speech sound is passed in a nonlinear communication channel. Experimental results are shown in the case of real speech communications from a spaceship to terrestrial 3G cellular network assuming nonlinear interferences.Keywords
This publication has 13 references indexed in Scilit:
- Two-Microphone Voice Activity Detection Based on the Homogeneity of the Direction of Arrival EstimatesPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2007
- A wavelet-based voice activity detection algorithm in noisy environmentsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- On a simple algorithm to calculate the 'energy' of a signalPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Comparison of voice activity detection algorithms for wireless personal communications systemsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Performance evaluation and comparison of G.729/AMR/fuzzy voice activity detectorsIEEE Signal Processing Letters, 2002
- GSC-based spatial voice activity detection for enhanced speech coding in the presence of competing speechIEEE Transactions on Speech and Audio Processing, 2001
- Teager energy based feature parameters for speech recognition in car noiseIEEE Signal Processing Letters, 1999
- ITU-T Recommendation G.729 Annex B: a silence compression scheme for use with G.729 optimized for V.70 digital simultaneous voice and data applicationsIEEE Communications Magazine, 1997
- Low bit rate transparent audio compression using adapted waveletsIEEE Transactions on Signal Processing, 1993
- Ten Lectures on WaveletsPublished by Society for Industrial & Applied Mathematics (SIAM) ,1992