An Iterative Clustering Procedure

1 July 1971

journal article
Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Systems, Man, and Cybernetics

Vol. SMC-1 (3), 275-289
https://doi.org/10.1109/tsmc.1971.4308295

Abstract

In many remote sensing applications millions of measurements can be made from a satellite at one time, and many times the data is of marginal value. In these situations clustering techniques might save much data transmission without loss of information since cluster codes may be transmitted instead of multidimensional data points. Data points within a cluster are highly similar so that interpretation of the cluster code can be meaningfully made on the basis of knowing what sort of data point is typical of those in the cluster. We introduce an iterative clustering technique; the procedure suboptimally minimizes the probability of differences between the binary reconstructions from the cluster codes and the original binary data. The iterative clustering technique was programmed for the GE 635 KANDIDATS (Kansas Digital Image Data System) and tested on two data sets. The first was a multi-image set. Twelve images of the northern part of Yellowstone Park were taken by the Michigan scanner system, and the images were reduced and run with the program. Thirty-thousand data points, each consisting of a binary vector of 25 components, were clustered into four clusters. The percentage difference between the components of the reconstructed binary data and the original binary data was 20 percent. The second data set consisted of measurements of the frequency content of the signals from lightning discharges. One hundred and thirty-four data measurements, each consisting of a binary vector of 32 components, were clustered into four clusters.

Keywords

This publication has 44 references indexed in Scilit:

On a class of unsupervised estimation problems
IEEE Transactions on Information Theory, 1968
Nonsupervised sequential classification and recognition of patterns
IEEE Transactions on Information Theory, 1966
An Adaptive Pattern Classification System
IEEE Transactions on Systems Science and Cybernetics, 1966
A Technique for Determining and Coding Subclasses in Pattern Recognition Problems
IBM Journal of Research and Development, 1965
A convergence theorem for linear threshold elements
Bulletin of Mathematical Biology, 1965
A note on the elementary α-perceptron
Bulletin of Mathematical Biology, 1964
A Mathematical Theory of Pattern Recognition
The Annals of Mathematical Statistics, 1963
Hierarchical Linkage Analysis for the Isolation of Types
Educational and Psychological Measurement, 1960
A Quantitative Approach to a Problem in Classification
Evolution, 1957
Analysis of a complex of statistical variables into principal components.
Journal of Educational Psychology, 1933

Cited by 10 articles