Speech recognition based on top‐down and bottom‐up phoneme recognition

1 January 1986

journal article
research article
Published by Wiley in Systems and Computers in Japan

Vol. 17 (7) , 95-106
https://doi.org/10.1002/scj.4690170711

Abstract

This paper discusses a speech recognition system which integrates the top‐down and bottom‐up phoneme recognitions. The system is based on the recognition of phonemes, where the top‐down and bottom‐up processings are combined using a table called a blackboard. In top‐down processing, the segmentation and the scoring are performed for each phoneme in the total speech interval, and in the bottom‐up processing, only for the interval in which the phoneme segmentation can be performed with certainty. By this scheme, the two recognition processings cooperate, while maintaining their independence. In the proposed system, the linguistic processing and the acoustic processing are structured hierarchically. The two parts are combined through the blackboard, avoiding duplicated processings in the same environment. To evaluate the constructed system, a spoken word recognition experiment with the word dictionaries composed of 100 or 643 city names, and the continuous speech recognition experiment for 235 minimal phrases uttered by two examinees were performed. It was observed as a result that the recognition performance by the traditional top‐down processing is almost maintained, while the processing time is decreased to one‐half or one‐third in word recognition and less than one‐fourth in minimal phrase recognition.

Keywords

This publication has 11 references indexed in Scilit:

Spoken word recognition based on top-down phoneme segmentation
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2005
Two-Level DP-Matching—A Dynamic Programming-Based Pattern Matching Algorithm for Connected Word Recognition
Published by Elsevier ,1990
A hierarchical decision approach to large-vocabulary discrete utterance recognition
IEEE Transactions on Acoustics, Speech, and Signal Processing, 1983
Demisyllable-based isolated word recognition system
IEEE Transactions on Acoustics, Speech, and Signal Processing, 1983
Isolated Word Recognition for Large Vocabularies
Bell System Technical Journal, 1982
Speaker-independent isolated word recognition using a 129-word airline vocabulary
The Journal of the Acoustical Society of America, 1982
LPC peak weighted spectral matching measures
Electronics and Communications in Japan (Part I: Communications), 1981
A level building dynamic time warping algorithm for connected word recognition
IEEE Transactions on Acoustics, Speech, and Signal Processing, 1981
Speech Understanding Systems
Published by Defense Technical Information Center (DTIC) ,1976
Organization of the Hearsay II speech understanding system
IEEE Transactions on Acoustics, Speech, and Signal Processing, 1975