On adaptive decision rules and decision parameter adaptation for automatic speech recognition
Open Access
- 1 August 2000
- journal article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in Proceedings of the IEEE
- Vol. 88 (8) , 1241-1269
- https://doi.org/10.1109/5.880082
Abstract
Recent advances in automatic speech recognition are accomplished by designing a plug-in maximum a posteriori decision rule such that the forms of the acoustic and language model distributions are specified and the parameters of the assumed distributions are estimated from a collection of speech and language training corpora. Maximum-likelihood point estimation is by far the most prevailing training method. However, due to the problems of unknown speech distributions, sparse training data, high spectral and temporal variabilities in speech, and possible mismatch between training and testing conditions, a dynamic training strategy is needed. To cope with the changing speakers and speaking conditions in real operational conditions for high-performance speech recognition, such paradigms incorporate a small amount of speaker and environment specific adaptation data into the training process. Bayesian adaptive learning is an optimal way to combine prior knowledge in an existing collection of general models with a new set of condition-specific adaptation data. In this paper, the mathematical framework for Bayesian adaptation of acoustic and language model parameters is first described. Maximum a posteriori point estimation is then developed for hidden Markov models and a number of useful parameters densities commonly used in automatic speech recognition and natural language processing.Keywords
This publication has 138 references indexed in Scilit:
- Discriminative utterance verification for connected digits recognitionIEEE Transactions on Speech and Audio Processing, 1997
- Bayesian adaptive learning of the parameters of hidden Markov model for speech recognitionIEEE Transactions on Speech and Audio Processing, 1995
- Integrated models of signal and background with application to speaker identification in noiseIEEE Transactions on Speech and Audio Processing, 1994
- Discriminative learning for minimum error classification (pattern recognition)IEEE Transactions on Signal Processing, 1992
- A Quasi-Bayesian Approach to Estimating Parameters for Mixtures of Normal DistributionsJournal of Business & Economic Statistics, 1991
- Tied mixture continuous parameter modeling for speech recognitionIEEE Transactions on Acoustics, Speech, and Signal Processing, 1990
- A tutorial on hidden Markov models and selected applications in speech recognitionProceedings of the IEEE, 1989
- A training procedure for isolated word recognition systemsIEEE Transactions on Acoustics, Speech, and Signal Processing, 1980
- Sample-based classification procedures related to empiric distributionsIEEE Transactions on Information Theory, 1976
- Sample-Based Classification Procedures Derived from Density EstimatorsJournal of the American Statistical Association, 1972