Avoiding false local minima by proper initialization of connections
- 1 January 1992
- journal article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Neural Networks
- Vol. 3 (6), 899-905
- https://doi.org/10.1109/72.165592
Abstract
The training of neural net classifiers is often hampered by the occurrence of local minima, which results in the attainment of inferior classification performance. It has been shown that the occurrence of local minima in the criterion function is often related to specific patterns of defects in the classifier. In particular, three main causes for local minima were identified. Such an understanding of the physical correlates of local minima suggests sensible ways of choosing the weights from which the training process is initiated. A method of initialization is introduced and shown to decrease the possibility of local minima occurring on various test problems.Keywords
This publication has 5 references indexed in Scilit:
- The Physical Correlates of Local MinimaPublished by Springer Nature ,1990
- A comparison between criterion functions for linear classifiers, with an application to neural netsIEEE Transactions on Systems, Man, and Cybernetics, 1989
- Universal approximation using feedforward networks with non-sigmoid hidden layer activation functionsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1989
- A Learning Algorithm for Boltzmann Machines*Cognitive Science, 1985
- Restart procedures for the conjugate gradient methodMathematical Programming, 1977