A comparison of phoneme decision tree (PDT) and context adaptive phone (CAP) based approaches to vocabulary-independent speech recognition
- 17 December 2002
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- Vol. i (15206149) , I/541-I/544
- https://doi.org/10.1109/icassp.1994.389237
Abstract
This paper compares two approaches to context-sensitive phoneme-level hidden Markov modelling for vocabulary-independent automatic speech recognition. The first is an existing method based on the use of binary decision trees to identify equivalence classes of contexts which induce the same effect on the acoustic realisation of a given phoneme. The second is a novel method, called context adaptive phone modelling, which is based on the use of 'context-independent generalised phones-in-context'. In the first method equivalence classes of contexts are derived from a direct analysis of the acoustic patterns, whereas the second approach utilises a symbolic transcription of the training corpus. The paper presents an experimental and methodological comparison of the two methods.<>Keywords
This publication has 7 references indexed in Scilit:
- Context-dependent modeling for acoustic-phonetic recognition of continuous speechPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- On vocabulary-independent speech modelingPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Recognition of demisyllable based units using semicontinuous hidden Markov modelsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1992
- Decision trees for phonological rules in continuous speechPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1991
- Improved vocabulary-independent sub-word HMM modellingPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1991
- Large vocabulary word recognition using context-dependent allophonic hidden Markov modelsComputer Speech & Language, 1990
- Phoneme-in-context modeling for dragon's continuous speech recognizerPublished by Association for Computational Linguistics (ACL) ,1990