Relative entropy and learning rules
- 1 January 1991
- journal article
- research article
- Published by American Physical Society (APS) in Physical Review A
- Vol. 43 (2), 1061-1070
- https://doi.org/10.1103/physreva.43.1061
Abstract
The dynamics of a probabilistic neural network is characterized by the distribution ν(x’‖x) of successor states x’ of an arbitrary state x of the network. A prescribed memory or behavior pattern is represented in terms of an ordered sequence of network states ,,...,. A successful procedure for learning this pattern must modify the neuronal interactions in such a way that the dynamical successor of is likely to be , with =. The relative entropy G of the probability distribution ,x’ concentrated at the desired successor state, evaluated with respect to the dynamical distribution ν(x’‖), is used to quantify this criterion, by providing a measure of the distance between actual and ideal probability distributions. Minimization of G subject to appropriate resource constraints leads to ‘‘optimal’’ learning rules for pairwise and higher-order neuronal interactions. The degree to which optimality is approached by simple learning rules in current use is considered, and it is found, in particular, that the algorithm adopted in the Hopfield model is more effective in minimizing G than the original Hebb law.
Keywords
This publication has 18 references indexed in Scilit:
- Experiments in artificial psychology: conditioning of asynchronous neutral network modelsMathematical Biosciences, 1990
- On learning rules and memory storage abilities of asymmetrical neural networksJournal de Physique, 1988
- Parallel Distributed ProcessingPublished by MIT Press ,1986
- Neural networks and physical systems with emergent collective computational abilities.Proceedings of the National Academy of Sciences, 1982
- Neural AssembliesPublished by Springer Nature ,1982
- Toward a modern theory of adaptive networks: Expectation and prediction.Psychological Review, 1981
- Adaptive pattern classification and universal recoding: I. Parallel development and coding of neural feature detectorsBiological Cybernetics, 1976
- The existence of persistent states in the brainMathematical Biosciences, 1974
- Two models for memory organization using interacting tracesMathematical Biosciences, 1970
- Outline of a theory of thought-processes and thinking machinesJournal of Theoretical Biology, 1961