Subband based classification of speech under stress

27 November 2002

conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE)

Vol. 1 (15206149) , 569-572
https://doi.org/10.1109/icassp.1998.674494

Abstract

This study proposes a new set of feature parameters based on subband analysis of the speech signal for classification of speech under stress. The new speech features are scale energy (SE), autocorrelation-scale-energy (ACSE), subband based cepstral parameters (SC), and autocorrelation-SC (ACSC). The parameters' ability to capture different stress types is compared to widely used mel-scale cepstrum based representations: mel-frequency cepstral coefficients (MFCC) and autocorrelation-mel-scale (AC-mel). Next, a feedforward neural network is formulated for speaker-dependent stress classification of 10 stress conditions: angry, clear, cond50/70, fast, loud, lombard, neutral, question, slow, and soft. The classification algorithm is evaluated using a previously established stressed speech database (SUSAS) (Hansen and Bou-Ghazale 1997). Subband based features are shown to achieve +7.3% and +9.1% increase in the classification rates over the MFCC based parameters for ungrouped and grouped stress closed vocabulary test scenarios respectively. Moreover the average scores across the simulations of new features are +8.6% and +13.6% higher than MFCC based features for the ungrouped and grouped stress test scenarios respectively.

Keywords

This publication has 10 references indexed in Scilit:

Acoustic-phonetic analysis of loud and Lombard speech in simulated cockpit conditions
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2003
Analysis of glottal waveforms across stress styles
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
Wavelet based analysis of speech under stress
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
Subband analysis for robust speech recognition in the presence of car noise
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
Getting started with SUSAS: a speech under simulated and actual stress database
Published by International Speech Communication Association ,1997
Analysis and compensation of speech under stress and noise for environmental robustness in speech recognition
Speech Communication, 1996
Feature analysis and neural network-based classification of speech under stress
IEEE Transactions on Speech and Audio Processing, 1996
Root cepstral analysis: A unified view. Application to speech processing in car noise environments
Speech Communication, 1993
Wavelets and signal processing
IEEE Signal Processing Magazine, 1991
Digital inverse filtering-a new tool for formant trajectory estimation
IEEE Transactions on Audio and Electroacoustics, 1972