Information geometry of Boltzmann machines
- 1 March 1992
- journal article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Neural Networks
- Vol. 3 (2), 260-271
- https://doi.org/10.1109/72.125867
Abstract
A Boltzmann machine is a network of stochastic neurons. The set of all the Boltzmann machines with a fixed topology forms a geometric manifold of high dimension, where modifiable synaptic weights of connections play the role of a coordinate system to specify networks. A learning trajectory, for example, is a curve in this manifold. It is important to study the geometry of the neural manifold, rather than the behavior of a single network, in order to know the capabilities and limitations of neural networks of a fixed topology. Using the new theory of information geometry, a natural invariant Riemannian metric and a dual pair of affine connections on the Boltzmann neural network manifold are established. The meaning of geometrical structures is elucidated from the stochastic and the statistical point of view. This leads to a natural modification of the Boltzmann machine learning rule.Keywords
This publication has 20 references indexed in Scilit:
- Four Types of Learning CurvesNeural Computation, 1992
- Statistical mechanics of learning from examplesPhysical Review A, 1992
- Asymptotic Theory of Sequential Estimation: Differential Geometrical ApproachThe Annals of Statistics, 1991
- Mathematical foundations of neurocomputingProceedings of the IEEE, 1990
- The Geometry of Asymptotic InferenceStatistical Science, 1989
- Statistical inference under multiterminal rate restrictions: a differential geometric approachIEEE Transactions on Information Theory, 1989
- Estimation in the Presence of Infinitely many Nuisance Parameters--Geometry of Estimating FunctionsThe Annals of Statistics, 1988
- Differential geometry of a parametric family of invertible linear systems—Riemannian metric, dual affine connections, and divergenceTheory of Computing Systems, 1987
- Geometrical theory of higher-order asymptotics of test, interval estimator and conditional inferenceProceedings of the Royal Society of London. Series A. Mathematical and Physical Sciences, 1983
- Differential Geometry of Curved Exponential Families-Curvatures and Information LossThe Annals of Statistics, 1982