An Iterative Clustering Procedure
- 1 July 1971
- journal article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Systems, Man, and Cybernetics
- Vol. SMC-1 (3) , 275-289
- https://doi.org/10.1109/tsmc.1971.4308295
Abstract
In many remote sensing applications millions of measurements can be made from a satellite at one time, and many times the data is of marginal value. In these situations clustering techniques might save much data transmission without loss of information since cluster codes may be transmitted instead of multidimensional data points. Data points within a cluster are highly similar so that interpretation of the cluster code can be meaningfully made on the basis of knowing what sort of data point is typical of those in the cluster. We introduce an iterative clustering technique; the procedure suboptimally minimizes the probability of differences between the binary reconstructions from the cluster codes and the original binary data. The iterative clustering technique was programmed for the GE 635 KANDIDATS (Kansas Digital Image Data System) and tested on two data sets. The first was a multi-image set. Twelve images of the northern part of Yellowstone Park were taken by the Michigan scanner system, and the images were reduced and run with the program. Thirty-thousand data points, each consisting of a binary vector of 25 components, were clustered into four clusters. The percentage difference between the components of the reconstructed binary data and the original binary data was 20 percent. The second data set consisted of measurements of the frequency content of the signals from lightning discharges. One hundred and thirty-four data measurements, each consisting of a binary vector of 32 components, were clustered into four clusters.Keywords
This publication has 44 references indexed in Scilit:
- On a class of unsupervised estimation problemsIEEE Transactions on Information Theory, 1968
- Nonsupervised sequential classification and recognition of patternsIEEE Transactions on Information Theory, 1966
- An Adaptive Pattern Classification SystemIEEE Transactions on Systems Science and Cybernetics, 1966
- A Technique for Determining and Coding Subclasses in Pattern Recognition ProblemsIBM Journal of Research and Development, 1965
- A convergence theorem for linear threshold elementsBulletin of Mathematical Biology, 1965
- A note on the elementary α-perceptronBulletin of Mathematical Biology, 1964
- A Mathematical Theory of Pattern RecognitionThe Annals of Mathematical Statistics, 1963
- Hierarchical Linkage Analysis for the Isolation of TypesEducational and Psychological Measurement, 1960
- A Quantitative Approach to a Problem in ClassificationEvolution, 1957
- Analysis of a complex of statistical variables into principal components.Journal of Educational Psychology, 1933