A new speaker adaptation technique using very short calibration speech
- 1 January 1993
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- Vol. 2, 562-565 vol.2
- https://doi.org/10.1109/icassp.1993.319369
Abstract
A speaker adaptation technique based on the separation of speech spectra variation sources is developed for improving speaker-independent continuous speech recognition. The variation sources include speaker acoustic characteristics, phonologic characteristics, and contextual dependency of allophones. Statistical methods are formulated to normalize speech spectra based on speaker acoustic characteristics and then adapt mixture Gaussian density phone models based on speaker phonologic characteristics. Adaptation experiments using short calibration speech (5 s/speaker) have shown substantial performance improvement over the baseline recognition system. On a TIMIT test set, where the task vocabulary size is 853 and the test set perplexity is 104, the recognition word accuracy has been improved from 86.9% to 90.6% (28.2% error reduction). On a separate test set which contains an additional variation source of recording channel mismatch and with the test set perplexity of 101, the recognition word accuracy has been improved from 65.4% to 85.5% (58.1% error reduction).<>Keywords
This publication has 12 references indexed in Scilit:
- Unsupervised speaker adaptation by probabilistic spectrum fittingPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- Unsupervised speaker adaptation method based on hierarchical spectral clusteringPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- A study on speaker adaptation of continuous density HMM parametersPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- A speaker-independent continuous speech recognition system using continuous mixture Gaussian density HMM of phoneme-sized unitsIEEE Transactions on Speech and Audio Processing, 1993
- A Bayesian approach to speaker adaptation for the stochastic segment modelPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1992
- An LVQ based reference model for speaker-adaptive speech recognitionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1992
- On speaker-independent, speaker-dependent, and speaker-adaptive speech recognitionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1991
- Speaker adaptation in continuous speech recognition via estimation of correlated mean vectorsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1991
- Vowel normalization by frequency warped spectral matchingSpeech Communication, 1986
- Speaker adaptation for word-based speech recognition systemsThe Journal of the Acoustical Society of America, 1981