TDNN-LR continuous speech recognition system using adaptive incremental TDNN training
- 1 January 1991
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- No. 15206149,p. 53-56 vol.1
- https://doi.org/10.1109/icassp.1991.150276
Abstract
An investigation of speech recognition and language processing is described. The speech recognition part consists of the large phonemic time-delay neural networks (TDNNs) which can automatically spot all 24 Japanese phonemes by simply scanning input speech. The language processing part is made up of a predictive LR parser which predicts subsequent phonemes based on the currently proposed phonemes. This TDNN-LR recognition system provides large-vocabulary and continuous speech recognition. Recognition experiments for ATR's conference registration task were performed using the TDNN-LR method. Speaker-dependent phrase recognition rates of 65.1% for the first choices and 88.8% within the fifth choices were attained. Also, efficiency in the adaptive incremental training using a small number of training tokens extracted from continuous speech was confirmed in the TDNN-LR system.<>Keywords
This publication has 9 references indexed in Scilit:
- Consonant recognition by modular construction of large phonemic time-delay neural networksPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- Construction of a large-scale Japanese speech database and its management systemPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- Spotting Japanese CV-syllables and phonemes using the time-delay neural networksPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- HMM continuous speech recognition using predictive LR parsingPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- Integrated training for spotting Japanese phonemes using large phonemic time-delay neural networksPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Phoneme recognition using time-delay neural networksIEEE Transactions on Acoustics, Speech, and Signal Processing, 1989
- Parallelism, hierarchy, scaling in time-delay neural networks for spotting Japanese phonemes CV-syllablesPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1989
- Efficient Parsing for Natural LanguagePublished by Springer Nature ,1986
- A level building dynamic time warping algorithm for connected word recognitionIEEE Transactions on Acoustics, Speech, and Signal Processing, 1981