Balancing Accuracy and Parsimony in Genetic Programming

1 March 1995

journal article
Published by MIT Press in Evolutionary Computation

Vol. 3 (1), 17-38
https://doi.org/10.1162/evco.1995.3.1.17

Abstract

Genetic programming is distinguished from other evolutionary algorithms in that it uses tree representations of variable size instead of linear strings of fixed length. The flexible representation scheme is very important because it allows the underlying structure of the data to be discovered automatically. One primary difficulty, however, is that the solutions may grow too big without any improvement of their generalization ability. In this article we investigate the fundamental relationship between the performance and complexity of the evolved structures. The essence of the parsimony problem is demonstrated empirically by analyzing error landscapes of programs evolved for neural network synthesis. We consider genetic programming as a statistical inference problem and apply the Bayesian model-comparison framework to introduce a class of fitness functions with error and complexity terms. An adaptive learning method is then presented that automatically balances the model-complexity factor to evolve parsimonious programs without losing the diversity of the population needed for achieving the desired training accuracy. The effectiveness of this approach is empirically shown on the induction of sigma-pi neural networks for solving a real-world medical diagnosis problem as well as benchmark tasks.

Keywords

This publication has 9 references indexed in Scilit:

Neural Networks and the Bias/Variance Dilemma
Neural Computation, 1992
An information criterion for optimal neural network selection
IEEE Transactions on Neural Networks, 1991
Inferring decision trees using the minimum description lenght principle
Information and Computation, 1989
Product Units: A Computationally Powerful and Biologically Plausible Extension to Backpropagation Networks
Neural Computation, 1989
Learning, invariance, and generalization in high-order neural networks
Applied Optics, 1987
Occam's Razor
Information Processing Letters, 1987
Stochastic Complexity and Modeling
The Annals of Statistics, 1986
Connectionist models and their properties
Cognitive Science, 1982
Polynomial Theory of Complex Systems
IEEE Transactions on Systems, Man, and Cybernetics, 1971

Cited by 149 articles