Distortion measures for speech processing
- 1 August 1980
- journal article
- research article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Acoustics, Speech, and Signal Processing
- Vol. 28 (4) , 367-376
- https://doi.org/10.1109/tassp.1980.1163421
Abstract
Several properties, interrelations, and interpretations are developed for various speech spectral distortion measures. The principle results are 1) the development of notions of relative strength and equivalence of the various distortion measures both in a mathematical sense corresponding to subjective equivalence and in a coding sense when used in minimum distortion or nearest neighbor speech processing systems; 2) the demonstration that the Itakura-Saito and related distortion measures possess a property similar to the triangle inequality when used in nearest neighbor systems such as quantization and cluster analysis; and 3) that the Itakura-Saito and normalized model distortion measures yield efficient computation algorithms for generalized centroids or minimum distortion points of groups or clusters of speech frames, an important computation in both classical cluster analysis techniques and in algorithms for optimal quantizer design. We also argue that the Itakura-Saito and related distortions are well-suited computationally, mathematically, and intuitively for such applications.Keywords
This publication has 17 references indexed in Scilit:
- Speech coding based upon vector quantizationPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- A two-step speech compression system with vector quantizingPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- Rate-distortion speech coding with a minimum discrimination information distortion measureIEEE Transactions on Information Theory, 1981
- Locally optimal block quantizer designInformation and Control, 1980
- Automatic Classification of Electroencephalograms: Kullback-Leibler Nearest Neighbor RulesScience, 1979
- Interactive clustering techniques for selecting speaker-independent reference templates for isolated word recognitionIEEE Transactions on Acoustics, Speech, and Signal Processing, 1979
- Statistical tests and distance measures for LPC coefficientsIEEE Transactions on Acoustics, Speech, and Signal Processing, 1977
- Quantization properties of transmission parameters in linear predictive systemsIEEE Transactions on Acoustics, Speech, and Signal Processing, 1975
- $I$-Divergence Geometry of Probability Distributions and Minimization ProblemsThe Annals of Probability, 1975
- The Divergence and Bhattacharyya Distance Measures in Signal SelectionIEEE Transactions on Communications, 1967