An investigation of PLP and IMELDA acoustic representations and of their potential for combination

1 January 1991

conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE)

No. 15206149,p. 881-884 vol.2
https://doi.org/10.1109/icassp.1991.150480

Abstract

Two acoustic representations, integrated Mel-scale representation with LDA (IMELDA) and perceptual linear prediction-root power sums (PLP-RPS), both of which have given good results in speech recognition tests, are explored. IMELDA is examined in the context of some related representations. Results of speaker-dependent and independent tests with digits and the alphabet suggest that the optimum PLP order is high and that the effectiveness of PLP-RPS stems not from its modeling of perceptual properties but from its approximation to a desirable statistical property attained exactly by IMELDA. A combined PLP-IMELDA representation is found to be generally more effective than PLP-RPS, but an IMELDA representation derived directly from a filter-bank provides similar results to PLP-IMELDA at a lower computational cost.

Keywords

This publication has 8 references indexed in Scilit:

An efficient speaker-independent automatic speech recognition by simulation of some properties of human auditory perception
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2005
Perceptually based linear predictive analysis of speech
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2005
The effective second formant F2' and the vocal tract front-cavity
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2003
A comparison of several acoustic representations for speech recognition with degraded and undegraded speech
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2003
Speech recognition with continuous-parameter hidden Markov models
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2003
Optimization of perceptually-based ASR front-end (automatic speech recognition)
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2003
Speaker dependent and independent speech recognition experiments with an auditory model
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2003
Robust speaker-independent word recognition using static, dynamic and acceleration features: experiments with Lombard and noisy speech
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002