Deterministic annealing for clustering, compression, classification, regression, and related optimization problems
- 1 November 1998
- journal article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in Proceedings of the IEEE
- Vol. 86 (11), 2210-2239
- https://doi.org/10.1109/5.726788
Abstract
The deterministic annealing approach to clustering and its extensions has demonstrated substantial performance improvement over standard supervised and unsupervised learning methods in a variety of important applications including compression, estimation, pattern recognition and classification, and statistical regression. The application-specific cost is minimized subject to a constraint on the randomness of the solution, which is gradually lowered. We emphasize the intuition gained from analogy to statistical physics. Alternatively the method is derived within rate-distortion theory, where the annealing process is equivalent to computation of Shannon's rate-distortion function, and the annealing temperature is inversely proportional to the slope of the curve. The basic algorithm is extended by incorporating structural constraints to allow optimization of numerous popular structures including vector quantizers, decision trees, multilayer perceptrons, radial basis functions, and mixtures of experts.Keywords
This publication has 86 references indexed in Scilit:
- Regression modeling in back-propagation and projection pursuit learningIEEE Transactions on Neural Networks, 1994
- On the training of radial basis function classifiersNeural Networks, 1992
- Adaptive Mixtures of Local ExpertsNeural Computation, 1991
- Derivation of a class of training algorithmsIEEE Transactions on Neural Networks, 1990
- Fast Learning in Networks of Locally-Tuned Processing UnitsNeural Computation, 1989
- A construction of vector quantizers for noisy channelsElectronics and Communications in Japan (Part I: Communications), 1984
- Speech coding based upon vector quantizationIEEE Transactions on Acoustics, Speech, and Signal Processing, 1980
- A new look at the statistical model identificationIEEE Transactions on Automatic Control, 1974
- Instabilities of Regression Estimates Relating Air Pollution to MortalityTechnometrics, 1973
- Information Theory and Statistical MechanicsPhysical Review B, 1957