Improving generalization in backpropagation networks with distributed bottlenecks
- 1 January 1989
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- p. 443-447 vol.1
- https://doi.org/10.1109/ijcnn.1989.118617
Abstract
The primary goal of any adaptive system that learns by example is to generalize from the training examples to novel inputs. The backpropagation learning algorithm is popular for its simplicity and landmark cases of generalization. It has been observed that backpropagation networks sometimes generalize better when they contain a hidden layer that has considerably fewer units than previous layers. The functional properties of such hidden-layer bottlenecks are analyzed, and a method for dynamically creating them, concurrent with backpropagation learning, is described. The method does not excise hidden units; rather, it compresses the dimensionality of the space spanned by the hidden-unit weight vectors and forms clusters of weight vectors in the low-dimensional space. The result is a functional bottleneck distributed across many units. The method is a gradient descent procedure, using local computations on simple lateral Hebbian connections between hidden units.Keywords
This publication has 9 references indexed in Scilit:
- Benefits of gain: speeded learning and minimal hidden layers in back-propagation networksIEEE Transactions on Systems, Man, and Cybernetics, 1991
- Neural nets for adaptive filtering and adaptive pattern recognitionComputer, 1988
- The emergence of generalization in networks with constrained representationsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1988
- An algebraic projection analysis for optimal hidden units size and learning rates in back-propagation learningPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1988
- A local interaction heuristic for adaptive networksPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1988
- Neural net pruning-why and howPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1988
- Learning representations by back-propagating errorsNature, 1986
- Monotone mapping of similarities into a general metric spaceJournal of Mathematical Psychology, 1974
- The analysis of proximities: Multidimensional scaling with an unknown distance function. I.Psychometrika, 1962