Improving generalization in backpropagation networks with distributed bottlenecks

1 January 1989

conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE)

p. 443-447 vol.1
https://doi.org/10.1109/ijcnn.1989.118617

Abstract

The primary goal of any adaptive system that learns by example is to generalize from the training examples to novel inputs. The backpropagation learning algorithm is popular for its simplicity and landmark cases of generalization. It has been observed that backpropagation networks sometimes generalize better when they contain a hidden layer that has considerably fewer units than previous layers. The functional properties of such hidden-layer bottlenecks are analyzed, and a method for dynamically creating them, concurrent with backpropagation learning, is described. The method does not excise hidden units; rather, it compresses the dimensionality of the space spanned by the hidden-unit weight vectors and forms clusters of weight vectors in the low-dimensional space. The result is a functional bottleneck distributed across many units. The method is a gradient descent procedure, using local computations on simple lateral Hebbian connections between hidden units.

Keywords

This publication has 9 references indexed in Scilit:

Benefits of gain: speeded learning and minimal hidden layers in back-propagation networks
IEEE Transactions on Systems, Man, and Cybernetics, 1991
Neural nets for adaptive filtering and adaptive pattern recognition
Computer, 1988
The emergence of generalization in networks with constrained representations
Published by Institute of Electrical and Electronics Engineers (IEEE) ,1988
An algebraic projection analysis for optimal hidden units size and learning rates in back-propagation learning
Published by Institute of Electrical and Electronics Engineers (IEEE) ,1988
A local interaction heuristic for adaptive networks
Published by Institute of Electrical and Electronics Engineers (IEEE) ,1988
Neural net pruning-why and how
Published by Institute of Electrical and Electronics Engineers (IEEE) ,1988
Learning representations by back-propagating errors
Nature, 1986
Monotone mapping of similarities into a general metric space
Journal of Mathematical Psychology, 1974
The analysis of proximities: Multidimensional scaling with an unknown distance function. I.
Psychometrika, 1962

Cited by 17 articles