Automatic labelling of continuous speech with a given phonetic transcription using dynamic programming algorithms
- 24 March 2005
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- Vol. 6, 1156-1159
- https://doi.org/10.1109/icassp.1981.1171095
Abstract
A system is described which allows the mapping of a phonetic transcription onto an acoustic parameter representation of continuous speech. Linear prediction analysis, segmentation and formant tracking provide the acoustic parameters on a 5 ms time frame basis and a sequence of voiced, unvoiced and silent segments. The given phonetic transcription is expanded to include implicit phone sequences and transitions. Labelling is then performed in two stages. Segment labelling maps substrings of the expanded phone string onto the acoustic segments using a dynamic programming algorithm. The acoustic and phonetic units are correlated directly by means of a table of acoustic-phonetic rules. Frame labelling labels each time frame with a single phone using another dynamic programming algorithm based on the derivatives of energy and formant functions. The method is found to objectify and considerably facilitate the establishment of a time-locked acoustic-phonetic database. Author(s) Wagner, M. Technische Universität München, Federal Republic of GermanyKeywords
This publication has 3 references indexed in Scilit:
- Dynamic programming algorithm optimization for spoken word recognitionIEEE Transactions on Acoustics, Speech, and Signal Processing, 1978
- Linear Prediction of SpeechPublished by Springer Nature ,1976
- An algorithm for automatic formant extraction using linear prediction spectraIEEE Transactions on Acoustics, Speech, and Signal Processing, 1974