Non-steady state speech analysis method with dynamic feature enhancing effect

24 March 2005

conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE)

Vol. 7, 1299-1302
https://doi.org/10.1109/icassp.1982.1171639

Abstract

Extraction of speaker independent features, which make the separation of /b/ /d/ /g/ or /p/ /t/ /k/ possible, is one of the most difficult problem in automatic speech recognition. The authors propose a new speech analysis method to handle these sounds characterized by high speed articulations. The method is based on an autoregressive model with linearly time variant parameters in the analysis window. Recursive method, which is achieved by solving simultaneous linear equations with same number of parameters as in LPC, is proposed assuming the framewise continuation of each parameter. An articulatory dynamic feature enhancing effect is created by the introduction of vocal tract reflection coefficients and the enhancement of the vocal tract (acoustic tube) shape change between adjacent frames. In experiments on Japanese CV-syllables, where C expresses stops, comparisons have been made between the proposed method and LPC-based method, and widely different results were obtained for the transient parts, especially in labio-dental sounds such as /d/.

Keywords

This publication has 1 reference indexed in Scilit:

Speech Analysis and Synthesis by Linear Prediction of the Speech Wave
The Journal of the Acoustical Society of America, 1971