Data driven search organization for continuous speech recognition
- 1 January 1992
- journal article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Signal Processing
- Vol. 40 (2) , 272-281
- https://doi.org/10.1109/78.124938
Abstract
The authors describe an architecture and search organization for continuous speech recognition. The recognition module is part of the Siemens-Philips-Ipo project on continuous speech recognition and understanding (SPICOS) system for the understanding of database queries spoken in natural language. The goal of this project is a man-machine dialogue system that is able to understand fluently spoken German sentences and thus to provide voice access to a database. The recognition strategy is based on Bayes decision rule and attempts to find the best interpretation of the input speech data in terms of knowledge sources such as a language model, pronunciation lexicon, and inventory of subword units. The implementation of the search has been tested on a continuous speech database comprising up to 4000 words for each of several speakers. The efficiency and robustness of the search organization have been checked and evaluated along many dimensions, such as different speakers, phoneme models, and language modelsKeywords
This publication has 8 references indexed in Scilit:
- Training of phoneme models in a sentence recognition systemPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- BYBLOS: The BBN continuous speech recognition systemPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- An algorithm for connected word recognitionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- Phoneme modelling using continuous mixture densitiesPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- Structural methods in automatic speech recognitionProceedings of the IEEE, 1985
- The use of a one-stage dynamic programming algorithm for connected word recognitionIEEE Transactions on Acoustics, Speech, and Signal Processing, 1984
- A Maximum Likelihood Approach to Continuous Speech RecognitionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1983
- An adaptive, ordered, graph search technique for dynamic time warping for isolated word recognitionIEEE Transactions on Acoustics, Speech, and Signal Processing, 1982