A robust voice activity detector for wireless communications using soft computing
- 1 January 1998
- journal article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Journal on Selected Areas in Communications
- Vol. 16 (9) , 1818-1829
- https://doi.org/10.1109/49.737650
Abstract
Discontinuous transmission based on speech/pause detection represents a valid solution to improve the spectral efficiency of new generation wireless communication systems. In this context, robust voice activity detection (VAD) algorithms are required, as traditional solutions present a high misclassification rate in the presence of the background noise typical of mobile environments. This paper presents a voice detection algorithm which is robust to noisy environments, thanks to a new methodology adopted for the matching process. More specifically, the VAD proposed is based on a pattern recognition approach in which the matching phase is performed by a set of six fuzzy rules, trained by means of a new hybrid learning tool. A series of objective tests performed on a large speech database, varying the signal-to-noise ratio (SNR), the types of background noise, and the input signal level, showed that, as compared with the VAD standardized by ITU-T in Recommendation G.729 annex B, the fuzzy VAD, on average, achieves an improvement in reduction both of the activity factor of about 25% and of the clipping introduced of about 43%. Informal listening tests also confirm an improvement in the perceived speech qualityKeywords
This publication has 20 references indexed in Scilit:
- Voice activity detection for cellular networksPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- Multilevel Speech Classification Based on Fuzzy LogicPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- A genetic approach to fuzzy learningPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Requirements on speech coders imposed by speech service solutions in cellular systemsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- FuGeNeSys-a fuzzy genetic neural system for fuzzy modelingIEEE Transactions on Fuzzy Systems, 1998
- ITU-T Recommendation G.729 Annex B: a silence compression scheme for use with G.729 optimized for V.70 digital simultaneous voice and data applicationsIEEE Communications Magazine, 1997
- Genetic algorithms and their applicationsIEEE Signal Processing Magazine, 1996
- Fuzzy logic, neural networks, and soft computingCommunications of the ACM, 1994
- Multiuser rate subband coding incorporating DSI and buffer controlIEEE Transactions on Communications, 1990
- Discrete-Time Analysis of Integrated Voice/Data Multiplexers With and Without Speech Activity DetectorsIEEE Journal on Selected Areas in Communications, 1983