Speaker independent phonetic transcription of fluent speech for large vocabulary speech recognition

13 January 2003

conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE)

No. 15206149,p. 441-444
https://doi.org/10.1109/icassp.1989.266458

Abstract

Results are presented of experiments on speaker independent phonetic transcription of fluent speech. The acoustic-phonetic model is a 38505-parameter continuously variable duration hidden Markov model which allows real-time phonetic transcription to be performed by means of a modified Viterbi algorithm. The model was trained on 3020 sentences from the TIMIT database. Testing was performed on the remaining 180 sentences. In a test without lexical or syntactic constraints, the authors obtained 52% correct phonetic transcription with 12% insertions. The design of a system for recognition of fluent speech based on the technique for phonetic transcription is described.

Keywords

This publication has 7 references indexed in Scilit:

Continuous speech recognition by means of acoustic/ Phonetic classification obtained from a hidden Markov model
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2005
On the use of instantaneous and transitional spectral information in speaker recognition
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2005
The DARPA 1000-word resource management database for continuous speech recognition
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2003
Segmental durations in connected-speech signals: Current results
The Journal of the Acoustical Society of America, 1988
On the use of bandpass liftering in speech recognition
IEEE Transactions on Acoustics, Speech, and Signal Processing, 1987
Continuously variable duration hidden Markov models for automatic speech recognition
Computer Speech & Language, 1986
Minimum prediction residual principle applied to speech recognition
IEEE Transactions on Acoustics, Speech, and Signal Processing, 1975