Relative entropy and learning rules

1 January 1991

journal article
research article
Published by American Physical Society (APS) in Physical Review A

Vol. 43 (2), 1061-1070
https://doi.org/10.1103/physreva.43.1061

Abstract

The dynamics of a probabilistic neural network is characterized by the distribution ν(x’‖x) of successor states x’ of an arbitrary state x of the network. A prescribed memory or behavior pattern is represented in terms of an ordered sequence of network states

x^{(1)}

x^{(2)}

,...,

x^{(l)}

. A successful procedure for learning this pattern must modify the neuronal interactions in such a way that the dynamical successor of

x^{(s)}

is likely to be

x^{(s + 1)}

, with

x^{(l + 1)}

x^{(1)}

. The relative entropy G of the probability distribution

δ_{x}^{(s + 1)}

,x’ concentrated at the desired successor state, evaluated with respect to the dynamical distribution ν(x’‖

x^{(s)}

), is used to quantify this criterion, by providing a measure of the distance between actual and ideal probability distributions. Minimization of G subject to appropriate resource constraints leads to ‘‘optimal’’ learning rules for pairwise and higher-order neuronal interactions. The degree to which optimality is approached by simple learning rules in current use is considered, and it is found, in particular, that the algorithm adopted in the Hopfield model is more effective in minimizing G than the original Hebb law.

Keywords

This publication has 18 references indexed in Scilit:

Experiments in artificial psychology: conditioning of asynchronous neutral network models
Mathematical Biosciences, 1990
On learning rules and memory storage abilities of asymmetrical neural networks
Journal de Physique, 1988
Parallel Distributed Processing
Published by MIT Press ,1986
Neural networks and physical systems with emergent collective computational abilities.
Proceedings of the National Academy of Sciences, 1982
Neural Assemblies
Published by Springer Nature ,1982
Toward a modern theory of adaptive networks: Expectation and prediction.
Psychological Review, 1981
Adaptive pattern classification and universal recoding: I. Parallel development and coding of neural feature detectors
Biological Cybernetics, 1976
The existence of persistent states in the brain
Mathematical Biosciences, 1974
Two models for memory organization using interacting traces
Mathematical Biosciences, 1970
Outline of a theory of thought-processes and thinking machines
Journal of Theoretical Biology, 1961

Cited by 16 articles