Japanese text input system based on continuous speech recognition

Abstract
A Japanese text input system using continuous speech recognition is described. This system is composed of two major parts, acoustic processing and linguistic processing. Consonant-Vowel (CV) lattice is generated by acoustic processing from input speech uttered phrase by phrase. CV syllables in continuous speech are detected by continuous dynamic programming (DP) based on SPLIT (Strings of Phoneme-LIke Templates) method. Multiple CV templates extracted from training speech data are used to improve detection accuracy. In linguistic processing, CV lattice is converted into written form using a word dictionary. CV recognition accuracy of 67 % and Japanese translation rate of 53 % are obtained as an experimental result.

This publication has 3 references indexed in Scilit: