A comparison of phoneme decision tree (PDT) and context adaptive phone (CAP) based approaches to vocabulary-independent speech recognition

17 December 2002

conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE)

Vol. i (15206149) , I/541-I/544
https://doi.org/10.1109/icassp.1994.389237

Abstract

This paper compares two approaches to context-sensitive phoneme-level hidden Markov modelling for vocabulary-independent automatic speech recognition. The first is an existing method based on the use of binary decision trees to identify equivalence classes of contexts which induce the same effect on the acoustic realisation of a given phoneme. The second is a novel method, called context adaptive phone modelling, which is based on the use of 'context-independent generalised phones-in-context'. In the first method equivalence classes of contexts are derived from a direct analysis of the acoustic patterns, whereas the second approach utilises a symbolic transcription of the training corpus. The paper presents an experimental and methodological comparison of the two methods.<>

Keywords

This publication has 7 references indexed in Scilit:

Context-dependent modeling for acoustic-phonetic recognition of continuous speech
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2005
On vocabulary-independent speech modeling
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
Recognition of demisyllable based units using semicontinuous hidden Markov models
Published by Institute of Electrical and Electronics Engineers (IEEE) ,1992
Decision trees for phonological rules in continuous speech
Published by Institute of Electrical and Electronics Engineers (IEEE) ,1991
Improved vocabulary-independent sub-word HMM modelling
Published by Institute of Electrical and Electronics Engineers (IEEE) ,1991
Large vocabulary word recognition using context-dependent allophonic hidden Markov models
Computer Speech & Language, 1990
Phoneme-in-context modeling for dragon's continuous speech recognizer
Published by Association for Computational Linguistics (ACL) ,1990