Automatic labelling of continuous speech with a given phonetic transcription using dynamic programming algorithms

24 March 2005

conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE)

Vol. 6, 1156-1159
https://doi.org/10.1109/icassp.1981.1171095

Abstract

A system is described which allows the mapping of a phonetic transcription onto an acoustic parameter representation of continuous speech. Linear prediction analysis, segmentation and formant tracking provide the acoustic parameters on a 5 ms time frame basis and a sequence of voiced, unvoiced and silent segments. The given phonetic transcription is expanded to include implicit phone sequences and transitions. Labelling is then performed in two stages. Segment labelling maps substrings of the expanded phone string onto the acoustic segments using a dynamic programming algorithm. The acoustic and phonetic units are correlated directly by means of a table of acoustic-phonetic rules. Frame labelling labels each time frame with a single phone using another dynamic programming algorithm based on the derivatives of energy and formant functions. The method is found to objectify and considerably facilitate the establishment of a time-locked acoustic-phonetic database. Author(s) Wagner, M. Technische Universität München, Federal Republic of Germany

Keywords

This publication has 3 references indexed in Scilit:

Dynamic programming algorithm optimization for spoken word recognition
IEEE Transactions on Acoustics, Speech, and Signal Processing, 1978
Linear Prediction of Speech
Published by Springer Nature ,1976
An algorithm for automatic formant extraction using linear prediction spectra
IEEE Transactions on Acoustics, Speech, and Signal Processing, 1974