Minimum Cross-Entropy Pattern Classification and Cluster Analysis

1 January 1982

journal article
Published by Institute of Electrical and Electronics Engineers (IEEE)

Vol. PAMI-4 (1) , 11-17
https://doi.org/10.1109/tpami.1982.4767189

Abstract

This paper considers the problem of classifying an input vector of measurements by a nearest neighbor rule applied to a fixed set of vectors. The fixed vectors are sometimes called characteristic feature vectors, codewords, cluster centers, models, reproductions, etc. The nearest neighbor rule considered uses a non-Euclidean information-theoretic distortion measure that is not a metric, but that nevertheless leads to a classification method that is optimal in a well-defined sense and is also computationally attractive. Furthermore, the distortion measure results in a simple method of computing cluster centroids. Our approach is based on the minimization of cross-entropy (also called discrimination information, directed divergence, K-L number), and can be viewed as a refinement of a general classification method due to Kullback. The refinement exploits special properties of cross-entropy that hold when the probability densities involved happen to be minimum cross-entropy densities. The approach is a generalization of a recently developed speech coding technique called speech coding by vector quantization.

Keywords

This publication has 16 references indexed in Scilit:

Rate-distortion speech coding with a minimum discrimination information distortion measure
IEEE Transactions on Information Theory, 1981
Properties of cross-entropy minimization
IEEE Transactions on Information Theory, 1981
Speech coding based upon vector quantization
IEEE Transactions on Acoustics, Speech, and Signal Processing, 1980
Axiomatic derivation of the principle of maximum entropy and the principle of minimum cross-entropy
IEEE Transactions on Information Theory, 1980
Automatic Classification of Electroencephalograms: Kullback-Leibler Nearest Neighbor Rules
Science, 1979
Determining probability distributions by maximum entropy and minimum cross-entropy
Published by Association for Computing Machinery (ACM) ,1979
$I$-Divergence Geometry of Probability Distributions and Minimization Problems
The Annals of Probability, 1975
Maximum Entropy for Hypothesis Formulation, Especially for Multidimensional Contingency Tables
The Annals of Mathematical Statistics, 1963
Information Theory and Statistical Mechanics. II
Physical Review B, 1957
On Quantum Measurements and the Role of the Uncertainty Relations in Statistical Mechanics
Physical Review B, 1937