Speech recognition based on top‐down and bottom‐up phoneme recognition
- 1 January 1986
- journal article
- research article
- Published by Wiley in Systems and Computers in Japan
- Vol. 17 (7) , 95-106
- https://doi.org/10.1002/scj.4690170711
Abstract
This paper discusses a speech recognition system which integrates the top‐down and bottom‐up phoneme recognitions. The system is based on the recognition of phonemes, where the top‐down and bottom‐up processings are combined using a table called a blackboard. In top‐down processing, the segmentation and the scoring are performed for each phoneme in the total speech interval, and in the bottom‐up processing, only for the interval in which the phoneme segmentation can be performed with certainty. By this scheme, the two recognition processings cooperate, while maintaining their independence. In the proposed system, the linguistic processing and the acoustic processing are structured hierarchically. The two parts are combined through the blackboard, avoiding duplicated processings in the same environment. To evaluate the constructed system, a spoken word recognition experiment with the word dictionaries composed of 100 or 643 city names, and the continuous speech recognition experiment for 235 minimal phrases uttered by two examinees were performed. It was observed as a result that the recognition performance by the traditional top‐down processing is almost maintained, while the processing time is decreased to one‐half or one‐third in word recognition and less than one‐fourth in minimal phrase recognition.Keywords
This publication has 11 references indexed in Scilit:
- Spoken word recognition based on top-down phoneme segmentationPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- Two-Level DP-Matching—A Dynamic Programming-Based Pattern Matching Algorithm for Connected Word RecognitionPublished by Elsevier ,1990
- A hierarchical decision approach to large-vocabulary discrete utterance recognitionIEEE Transactions on Acoustics, Speech, and Signal Processing, 1983
- Demisyllable-based isolated word recognition systemIEEE Transactions on Acoustics, Speech, and Signal Processing, 1983
- Isolated Word Recognition for Large VocabulariesBell System Technical Journal, 1982
- Speaker-independent isolated word recognition using a 129-word airline vocabularyThe Journal of the Acoustical Society of America, 1982
- LPC peak weighted spectral matching measuresElectronics and Communications in Japan (Part I: Communications), 1981
- A level building dynamic time warping algorithm for connected word recognitionIEEE Transactions on Acoustics, Speech, and Signal Processing, 1981
- Speech Understanding SystemsPublished by Defense Technical Information Center (DTIC) ,1976
- Organization of the Hearsay II speech understanding systemIEEE Transactions on Acoustics, Speech, and Signal Processing, 1975