Vector quantization based on Gaussian mixture models
- 1 July 2000
- journal article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Speech and Audio Processing
- Vol. 8 (4) , 385-401
- https://doi.org/10.1109/89.848220
Abstract
We model the underlying probability density function of vectors in a database as a Gaussian mixture (GM) model. The model is employed for high rate vector quantization analysis and for design of vector quantizers. It is shown that the high rate formulas accurately predict the performance of model-based quantizers. We propose a novel method for optimizing GM model parameters for high rate performance, and an extension to the EM algorithm for densities having bounded support is also presented. The methods are applied to quantization of LPC parameters in speech coding and we present new high rate analysis results for band-limited spectral distortion and outlier statistics. In practical terms, we find that an optimal single-stage VQ can operate at approximately 3 bits less than a state-of-the-art LSF-based 2-split VQ.Keywords
This publication has 24 references indexed in Scilit:
- Spectral quantization of cepstral coefficientsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Quantization, classification, and density estimation for Kohonen's Gaussian mixturePublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Efficient encoding of mel-generalized cepstrum for CELP codersPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Bennett's integral for vector quantizersIEEE Transactions on Information Theory, 1995
- Theoretical analysis of the high-rate vector quantization of LPC parametersIEEE Transactions on Speech and Audio Processing, 1995
- Robust and efficient quantization of speech LSP parameters using structured vector quantizersPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1991
- Asymptotic quantization error of continuous signals and the quantization dimensionIEEE Transactions on Information Theory, 1982
- Asymptotically optimal block quantizationIEEE Transactions on Information Theory, 1979
- An Iterative Procedure for Obtaining Maximum-Likelihood Estimates of the Parameters for a Mixture of Normal DistributionsSIAM Journal on Applied Mathematics, 1978
- Distance measures for speech processingIEEE Transactions on Acoustics, Speech, and Signal Processing, 1976