CMU robust vocabulary-independent speech recognition system
- 1 January 1991
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- p. 889-892 vol.2
- https://doi.org/10.1109/icassp.1991.150482
Abstract
Efforts to improve the performance of CMU's robust vocabulary-independent (VI) speech recognition systems on the DARPA speaker-independent resource management task are discussed. The improvements are evaluated on 320 sentences randomly selected from the DARPA June 88, February 89, and October 89 test sets. The first improvement involves more detailed acoustic modeling. The authors incorporated more dynamic features computed from the LPC cepstra and reduced error by 15% over the baseline system. The second improvement comes from a larger database. With more training data, the third improvement comes from a more detailed subword modeling. The authors incorporated the word boundary context into their VI subword modeling and it resulted in a 30% error reduction. Decision-tree allophone clustering was used to find more suitable models for the subword units not covered in the training set and further reduced error by 17%.Keywords
This publication has 10 references indexed in Scilit:
- Phoneme environment clustering for speech recognitionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- Allophone clustering for continuous speech recognitionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- On vocabulary-independent speech modelingPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Improved acoustic modeling with the SPHINX speech recognition systemPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1991
- Context-independent phonetic hidden Markov models for speaker-independent continuous speech recognitionIEEE Transactions on Acoustics, Speech, and Signal Processing, 1990
- Improved hidden Markov modeling for speaker-independent continuous speech recognitionPublished by Association for Computational Linguistics (ACL) ,1990
- An overview of the SPHINX speech recognition systemIEEE Transactions on Acoustics, Speech, and Signal Processing, 1990
- A tree-based statistical language model for natural language speech recognitionIEEE Transactions on Acoustics, Speech, and Signal Processing, 1989
- Towards speech recognition without vocabulary-specific trainingPublished by Association for Computational Linguistics (ACL) ,1989
- Speaker-independent isolated word recognition using dynamic features of speech spectrumIEEE Transactions on Acoustics, Speech, and Signal Processing, 1986