Speaker adaptation in continuous speech recognition via estimation of correlated mean vectors

1 January 1991

conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE)

No. 15206149,p. 865-868 vol.2
https://doi.org/10.1109/icassp.1991.150475

Abstract

Recent attempts to improve the recognition performance of a semi-continuous version of the CMU SPHINX system (SPHINX-SC) through the use of speaker adaptation are described. The authors' approach to speaker adaptation is to use multivariate parameter estimation procedures to update the mean values of the component densities which comprise the system's codebook, given the speaker-specific observations. The authors have developed a least mean square (LMS) algorithm which produces a faster rate of convergence than the Bayesian estimator, at the expense of a finite misadjustment. This estimate is similar in form to an LMS transversal filter, and is computationally more efficient than the Bayesian estimate. Results show an overall reduction of 2.0 to 3.4% in word error rate due to adaptation for a set of 11 speakers from the DARPA resource management task.

Keywords

This publication has 6 references indexed in Scilit:

Some statistical issues in the comparison of speech recognition algorithms
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2003
On semi-continuous hidden Markov modeling
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
A study on speaker adaptation of continuous density HMM parameters
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
An overview of the SPHINX speech recognition system
IEEE Transactions on Acoustics, Speech, and Signal Processing, 1990
Dynamic speaker adaptation for feature-based isolated word recognition
IEEE Transactions on Acoustics, Speech, and Signal Processing, 1987
A Posteriori Estimation of Correlated Jointly Gaussian Mean Vectors
Published by Institute of Electrical and Electronics Engineers (IEEE) ,1984