Subband based classification of speech under stress
- 27 November 2002
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- Vol. 1 (15206149) , 569-572
- https://doi.org/10.1109/icassp.1998.674494
Abstract
This study proposes a new set of feature parameters based on subband analysis of the speech signal for classification of speech under stress. The new speech features are scale energy (SE), autocorrelation-scale-energy (ACSE), subband based cepstral parameters (SC), and autocorrelation-SC (ACSC). The parameters' ability to capture different stress types is compared to widely used mel-scale cepstrum based representations: mel-frequency cepstral coefficients (MFCC) and autocorrelation-mel-scale (AC-mel). Next, a feedforward neural network is formulated for speaker-dependent stress classification of 10 stress conditions: angry, clear, cond50/70, fast, loud, lombard, neutral, question, slow, and soft. The classification algorithm is evaluated using a previously established stressed speech database (SUSAS) (Hansen and Bou-Ghazale 1997). Subband based features are shown to achieve +7.3% and +9.1% increase in the classification rates over the MFCC based parameters for ungrouped and grouped stress closed vocabulary test scenarios respectively. Moreover the average scores across the simulations of new features are +8.6% and +13.6% higher than MFCC based features for the ungrouped and grouped stress test scenarios respectively.Keywords
This publication has 10 references indexed in Scilit:
- Acoustic-phonetic analysis of loud and Lombard speech in simulated cockpit conditionsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- Analysis of glottal waveforms across stress stylesPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Wavelet based analysis of speech under stressPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Subband analysis for robust speech recognition in the presence of car noisePublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Getting started with SUSAS: a speech under simulated and actual stress databasePublished by International Speech Communication Association ,1997
- Analysis and compensation of speech under stress and noise for environmental robustness in speech recognitionSpeech Communication, 1996
- Feature analysis and neural network-based classification of speech under stressIEEE Transactions on Speech and Audio Processing, 1996
- Root cepstral analysis: A unified view. Application to speech processing in car noise environmentsSpeech Communication, 1993
- Wavelets and signal processingIEEE Signal Processing Magazine, 1991
- Digital inverse filtering-a new tool for formant trajectory estimationIEEE Transactions on Audio and Electroacoustics, 1972