A dictionary-adaptive speech driven user interface for a distributed multimedia platform
- 1 January 1999
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- Vol. 2 (10896503) , 326-332 vol.2
- https://doi.org/10.1109/eurmic.1999.794797
Abstract
The goal of the PRINCESS project is the development of a distributed end-to-end multimedia platform prototype. The platform supports access to media servers in a computer network via various types of mobile terminals. New paradigms for user interfaces are applied in order to make the terminal devices more intelligent and to enable hands-free operation for a busy user. A speech recognition module capable of recognising Finnish words has been integrated into the user interface. The vocabulary of command words is continuously restricted and adapted to the part of the user interface under control. The experimental system has been demonstrated in a distributed digital image retrieval application. Several different more conventional user interfaces may be used in the application, but the user may also opt to use a speech driven interface. Experiments show that the use of a dynamic vocabulary greatly reduces the recognition errors as compared to a fill set of command words in our application.Keywords
This publication has 6 references indexed in Scilit:
- Multimedia adaptation for dynamic environmentsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- The role of speech processing in human–computer intelligent communicationSpeech Communication, 1997
- Automatic Speech RecognitionPublished by Springer Nature ,1989
- A weighted cepstral distance measure for speech recognitionIEEE Transactions on Acoustics, Speech, and Signal Processing, 1987
- On the Application of Vector Quantization and Hidden Markov Models to Speaker-Independent, Isolated Word RecognitionBell System Technical Journal, 1983
- Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentencesIEEE Transactions on Acoustics, Speech, and Signal Processing, 1980