A new speaker adaptation technique using very short calibration speech

1 January 1993

conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE)

Vol. 2, 562-565 vol.2
https://doi.org/10.1109/icassp.1993.319369

Abstract

A speaker adaptation technique based on the separation of speech spectra variation sources is developed for improving speaker-independent continuous speech recognition. The variation sources include speaker acoustic characteristics, phonologic characteristics, and contextual dependency of allophones. Statistical methods are formulated to normalize speech spectra based on speaker acoustic characteristics and then adapt mixture Gaussian density phone models based on speaker phonologic characteristics. Adaptation experiments using short calibration speech (5 s/speaker) have shown substantial performance improvement over the baseline recognition system. On a TIMIT test set, where the task vocabulary size is 853 and the test set perplexity is 104, the recognition word accuracy has been improved from 86.9% to 90.6% (28.2% error reduction). On a separate test set which contains an additional variation source of recording channel mismatch and with the test set perplexity of 101, the recognition word accuracy has been improved from 65.4% to 85.5% (58.1% error reduction).<>

Keywords

This publication has 12 references indexed in Scilit:

Unsupervised speaker adaptation by probabilistic spectrum fitting
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2003
Unsupervised speaker adaptation method based on hierarchical spectral clustering
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2003
A study on speaker adaptation of continuous density HMM parameters
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
A speaker-independent continuous speech recognition system using continuous mixture Gaussian density HMM of phoneme-sized units
IEEE Transactions on Speech and Audio Processing, 1993
A Bayesian approach to speaker adaptation for the stochastic segment model
Published by Institute of Electrical and Electronics Engineers (IEEE) ,1992
An LVQ based reference model for speaker-adaptive speech recognition
Published by Institute of Electrical and Electronics Engineers (IEEE) ,1992
On speaker-independent, speaker-dependent, and speaker-adaptive speech recognition
Published by Institute of Electrical and Electronics Engineers (IEEE) ,1991
Speaker adaptation in continuous speech recognition via estimation of correlated mean vectors
Published by Institute of Electrical and Electronics Engineers (IEEE) ,1991
Vowel normalization by frequency warped spectral matching
Speech Communication, 1986
Speaker adaptation for word-based speech recognition systems
The Journal of the Acoustical Society of America, 1981