Non-steady state speech analysis method with dynamic feature enhancing effect
- 24 March 2005
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- Vol. 7, 1299-1302
- https://doi.org/10.1109/icassp.1982.1171639
Abstract
Extraction of speaker independent features, which make the separation of /b/ /d/ /g/ or /p/ /t/ /k/ possible, is one of the most difficult problem in automatic speech recognition. The authors propose a new speech analysis method to handle these sounds characterized by high speed articulations. The method is based on an autoregressive model with linearly time variant parameters in the analysis window. Recursive method, which is achieved by solving simultaneous linear equations with same number of parameters as in LPC, is proposed assuming the framewise continuation of each parameter. An articulatory dynamic feature enhancing effect is created by the introduction of vocal tract reflection coefficients and the enhancement of the vocal tract (acoustic tube) shape change between adjacent frames. In experiments on Japanese CV-syllables, where C expresses stops, comparisons have been made between the proposed method and LPC-based method, and widely different results were obtained for the transient parts, especially in labio-dental sounds such as /d/.Keywords
This publication has 1 reference indexed in Scilit:
- Speech Analysis and Synthesis by Linear Prediction of the Speech WaveThe Journal of the Acoustical Society of America, 1971