Robust automatic time alignment of orthographic transcriptions with unconstrained speech
- 1 January 1992
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- Vol. 1 (15206149) , 533-536 vol.1
- https://doi.org/10.1109/icassp.1992.225853
Abstract
A method for automatic time alignment of orthographically transcribed speech using supervised speaker-independent automatic speech recognition based on the orthographic transcription, an online dictionary, and HMM phone models is presented. This method successfully aligns transcriptions with speech in unconstrained 5 to 10 min conversations collected over long-distance telephone lines. It requires minimal manual processing and generally produces correct alignments despite the challenging nature of the data. The robustness and efficiency of the method make it a practical tool for very large speech corpora.This publication has 5 references indexed in Scilit:
- Phonetically sensitive discriminants for improved speech recognitionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- SWITCHBOARD: telephone speech corpus for research and developmentPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1992
- Voice across America: Toward robust speaker-independent speech recognition for telecommunications applicationsDigital Signal Processing, 1991
- Automatic segmentation and labeling of speechPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1991
- An acoustic-phonetic data baseThe Journal of the Acoustical Society of America, 1987