Speaker-independent isolated word recognition for telephone voice using phoneme-like templates
- 24 March 2005
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- Vol. 11, 2687-2690
- https://doi.org/10.1109/icassp.1986.1168583
Abstract
This paper describes a speaker-independent isolated word recognition algorithm for telephone voice and its recognition performance. The recognition algorithm consists of two processes ; dynamic time warping and statistical word discrimination. In the first process, input speech is compared with each word template using the dynamic time warping technique. Multiple word templates are used to deal with speech variations among speakers, where each word template is represented by a sequence of phoneme-like templates. To attain high recognition ability, a new technique for generating word templates is proposed. In the second process, statistical word discrimination is carried out for word candidates which have relatively low reliability in the first process. Discrimination functions are calculated based on statistics of transition tendencies of speech characteristics between adjacent frames, and the final word decision is made. The system was trained using utterances from 1305 speakers and tested with utterances from 259 speakers. The average recognition rate of 96.5% was obtained for a 16-word Japanese vocabulary set.Keywords
This publication has 5 references indexed in Scilit:
- Speaker-independent isolated word recognition for telephone voice using phoneme-like templatesPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- Isolated word recognition using phoneme-like templatesPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- Speaker‐independent isolated word recognition based on multiple templates using split methodSystems and Computers in Japan, 1985
- An Algorithm for Vector Quantizer DesignIEEE Transactions on Communications, 1980
- Interactive clustering techniques for selecting speaker-independent reference templates for isolated word recognitionIEEE Transactions on Acoustics, Speech, and Signal Processing, 1979