CMU robust vocabulary-independent speech recognition system

1 January 1991

conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE)

p. 889-892 vol.2
https://doi.org/10.1109/icassp.1991.150482

Abstract

Efforts to improve the performance of CMU's robust vocabulary-independent (VI) speech recognition systems on the DARPA speaker-independent resource management task are discussed. The improvements are evaluated on 320 sentences randomly selected from the DARPA June 88, February 89, and October 89 test sets. The first improvement involves more detailed acoustic modeling. The authors incorporated more dynamic features computed from the LPC cepstra and reduced error by 15% over the baseline system. The second improvement comes from a larger database. With more training data, the third improvement comes from a more detailed subword modeling. The authors incorporated the word boundary context into their VI subword modeling and it resulted in a 30% error reduction. Decision-tree allophone clustering was used to find more suitable models for the subword units not covered in the training set and further reduced error by 17%.

Keywords

This publication has 10 references indexed in Scilit:

Phoneme environment clustering for speech recognition
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2003
Allophone clustering for continuous speech recognition
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
On vocabulary-independent speech modeling
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
Improved acoustic modeling with the SPHINX speech recognition system
Published by Institute of Electrical and Electronics Engineers (IEEE) ,1991
Context-independent phonetic hidden Markov models for speaker-independent continuous speech recognition
IEEE Transactions on Acoustics, Speech, and Signal Processing, 1990
Improved hidden Markov modeling for speaker-independent continuous speech recognition
Published by Association for Computational Linguistics (ACL) ,1990
An overview of the SPHINX speech recognition system
IEEE Transactions on Acoustics, Speech, and Signal Processing, 1990
A tree-based statistical language model for natural language speech recognition
IEEE Transactions on Acoustics, Speech, and Signal Processing, 1989
Towards speech recognition without vocabulary-specific training
Published by Association for Computational Linguistics (ACL) ,1989
Speaker-independent isolated word recognition using dynamic features of speech spectrum
IEEE Transactions on Acoustics, Speech, and Signal Processing, 1986