Speech segmentation using probabilistic phonetic feature hierarchy and support vector machines

2 March 2004

conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE)

Vol. 1 (10987576) , 675-679
https://doi.org/10.1109/ijcnn.2003.1223445

Abstract

We propose a method that combines a probabilistic phonetic feature hierarchy with support vector machines for segmentation of continuous speech into five classes - vowel, sonorant consonant, fricative, stop and silence. We show that by using the hierarchy, only four binary classifiers are required to recognize the five classes. Due to the probabilistic nature of the hierarchy, the method overcomes the disadvantage of the traditional acoustic-phonetic methods where the error is carried down the hierarchy. In addition, the hierarchical approach allows the use of comparable amount of training data of two classes that each binary classifier is designed to discriminate. The segmentation method with 13 knowledge based parameters performs considerably better than a context-dependent hidden Markov model (HMM) based approach that uses 39 mel-cepstrum based parameters.

Keywords

This publication has 5 references indexed in Scilit:

Segmentation of continuous speech using acoustic-phonetic parameters and statistical learning
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2004
Support vector machine with dynamic time-alignment kernel for speech recognition
Published by International Speech Communication Association ,2001
Plosive spotting with margin classifiers
Published by International Speech Communication Association ,2001
On the use of support vector machines for phonetic classification
Published by Institute of Electrical and Electronics Engineers (IEEE) ,1999
The Nature of Statistical Learning Theory
Published by Springer Nature ,1995