An investigation of PLP and IMELDA acoustic representations and of their potential for combination
- 1 January 1991
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- No. 15206149,p. 881-884 vol.2
- https://doi.org/10.1109/icassp.1991.150480
Abstract
Two acoustic representations, integrated Mel-scale representation with LDA (IMELDA) and perceptual linear prediction-root power sums (PLP-RPS), both of which have given good results in speech recognition tests, are explored. IMELDA is examined in the context of some related representations. Results of speaker-dependent and independent tests with digits and the alphabet suggest that the optimum PLP order is high and that the effectiveness of PLP-RPS stems not from its modeling of perceptual properties but from its approximation to a desirable statistical property attained exactly by IMELDA. A combined PLP-IMELDA representation is found to be generally more effective than PLP-RPS, but an IMELDA representation derived directly from a filter-bank provides similar results to PLP-IMELDA at a lower computational cost.Keywords
This publication has 8 references indexed in Scilit:
- An efficient speaker-independent automatic speech recognition by simulation of some properties of human auditory perceptionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- Perceptually based linear predictive analysis of speechPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- The effective second formant F2' and the vocal tract front-cavityPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- A comparison of several acoustic representations for speech recognition with degraded and undegraded speechPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- Speech recognition with continuous-parameter hidden Markov modelsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- Optimization of perceptually-based ASR front-end (automatic speech recognition)Published by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- Speaker dependent and independent speech recognition experiments with an auditory modelPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- Robust speaker-independent word recognition using static, dynamic and acceleration features: experiments with Lombard and noisy speechPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002