Robust automatic time alignment of orthographic transcriptions with unconstrained speech

1 January 1992

conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE)

Vol. 1 (15206149) , 533-536 vol.1
https://doi.org/10.1109/icassp.1992.225853

Abstract

A method for automatic time alignment of orthographically transcribed speech using supervised speaker-independent automatic speech recognition based on the orthographic transcription, an online dictionary, and HMM phone models is presented. This method successfully aligns transcriptions with speech in unconstrained 5 to 10 min conversations collected over long-distance telephone lines. It requires minimal manual processing and generally produces correct alignments despite the challenging nature of the data. The robustness and efficiency of the method make it a practical tool for very large speech corpora.

This publication has 5 references indexed in Scilit:

Phonetically sensitive discriminants for improved speech recognition
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2003
SWITCHBOARD: telephone speech corpus for research and development
Published by Institute of Electrical and Electronics Engineers (IEEE) ,1992
Voice across America: Toward robust speaker-independent speech recognition for telecommunications applications
Digital Signal Processing, 1991
Automatic segmentation and labeling of speech
Published by Institute of Electrical and Electronics Engineers (IEEE) ,1991
An acoustic-phonetic data base
The Journal of the Acoustical Society of America, 1987