Balancing Accuracy and Parsimony in Genetic Programming
- 1 March 1995
- journal article
- Published by MIT Press in Evolutionary Computation
- Vol. 3 (1), 17-38
- https://doi.org/10.1162/evco.1995.3.1.17
Abstract
Genetic programming is distinguished from other evolutionary algorithms in that it uses tree representations of variable size instead of linear strings of fixed length. The flexible representation scheme is very important because it allows the underlying structure of the data to be discovered automatically. One primary difficulty, however, is that the solutions may grow too big without any improvement of their generalization ability. In this article we investigate the fundamental relationship between the performance and complexity of the evolved structures. The essence of the parsimony problem is demonstrated empirically by analyzing error landscapes of programs evolved for neural network synthesis. We consider genetic programming as a statistical inference problem and apply the Bayesian model-comparison framework to introduce a class of fitness functions with error and complexity terms. An adaptive learning method is then presented that automatically balances the model-complexity factor to evolve parsimonious programs without losing the diversity of the population needed for achieving the desired training accuracy. The effectiveness of this approach is empirically shown on the induction of sigma-pi neural networks for solving a real-world medical diagnosis problem as well as benchmark tasks.Keywords
This publication has 9 references indexed in Scilit:
- Neural Networks and the Bias/Variance DilemmaNeural Computation, 1992
- An information criterion for optimal neural network selectionIEEE Transactions on Neural Networks, 1991
- Inferring decision trees using the minimum description lenght principleInformation and Computation, 1989
- Product Units: A Computationally Powerful and Biologically Plausible Extension to Backpropagation NetworksNeural Computation, 1989
- Learning, invariance, and generalization in high-order neural networksApplied Optics, 1987
- Occam's RazorInformation Processing Letters, 1987
- Stochastic Complexity and ModelingThe Annals of Statistics, 1986
- Connectionist models and their propertiesCognitive Science, 1982
- Polynomial Theory of Complex SystemsIEEE Transactions on Systems, Man, and Cybernetics, 1971