A segmental speech model with applications to word spotting
- 1 January 1993
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- Vol. 2, 447-450 vol.2
- https://doi.org/10.1109/icassp.1993.319337
Abstract
The authors present a segmental speech model that explicitly models the dynamics in a variable-duration speech segment by using a time-varying trajectory model of the speech features in the segment. Each speech segment is represented by a set of statistics which includes a time-varying trajectory, a residual error covariance around the trajectory, and the number of frames in the segment. These statistics replace the frames in the segment and become the data that are modeled by either HMMs (hidden Markov models) or mixture models. This segment model is used to develop a secondary processing algorithm that rescores putative events hypothesized by a primary HMM word spotter to try to improve performance by discriminating true keywords from false alarms. This algorithm is evaluated on a keyword spotting task using the Road Rally Database, and performance is shown to improve significantly over that of the primary word spotter. The segmental model is also used on a TIMIT vowel classification task to evaluate its modeling capability.Keywords
This publication has 6 references indexed in Scilit:
- Explicit time correlation in hidden Markov models for speech recognitionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- On the use of instantaneous and transitional spectral information in speaker recognitionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- Continuous hidden Markov modeling for speaker-independent word spottingPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- Stochastic segment modelling using the estimate-maximize algorithm (speech recognition)Published by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- Signal representation, attribute extraction and, the use of distinctive features for phonetic classificationPublished by Association for Computational Linguistics (ACL) ,1991
- A dynamical system approach to continuous speech recognitionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1991