TDNN-LR continuous speech recognition system using adaptive incremental TDNN training

1 January 1991

conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE)

No. 15206149,p. 53-56 vol.1
https://doi.org/10.1109/icassp.1991.150276

Abstract

An investigation of speech recognition and language processing is described. The speech recognition part consists of the large phonemic time-delay neural networks (TDNNs) which can automatically spot all 24 Japanese phonemes by simply scanning input speech. The language processing part is made up of a predictive LR parser which predicts subsequent phonemes based on the currently proposed phonemes. This TDNN-LR recognition system provides large-vocabulary and continuous speech recognition. Recognition experiments for ATR's conference registration task were performed using the TDNN-LR method. Speaker-dependent phrase recognition rates of 65.1% for the first choices and 88.8% within the fifth choices were attained. Also, efficiency in the adaptive incremental training using a small number of training tokens extracted from continuous speech was confirmed in the TDNN-LR system.<>

Keywords

This publication has 9 references indexed in Scilit:

Consonant recognition by modular construction of large phonemic time-delay neural networks
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2003
Construction of a large-scale Japanese speech database and its management system
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2003
Spotting Japanese CV-syllables and phonemes using the time-delay neural networks
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2003
HMM continuous speech recognition using predictive LR parsing
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2003
Integrated training for spotting Japanese phonemes using large phonemic time-delay neural networks
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
Phoneme recognition using time-delay neural networks
IEEE Transactions on Acoustics, Speech, and Signal Processing, 1989
Parallelism, hierarchy, scaling in time-delay neural networks for spotting Japanese phonemes CV-syllables
Published by Institute of Electrical and Electronics Engineers (IEEE) ,1989
Efficient Parsing for Natural Language
Published by Springer Nature ,1986
A level building dynamic time warping algorithm for connected word recognition
IEEE Transactions on Acoustics, Speech, and Signal Processing, 1981