Bootstrap Techniques for Error Estimation

1 September 1987

journal article
Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Pattern Analysis and Machine Intelligence

Vol. PAMI-9 (5), 628-633
https://doi.org/10.1109/tpami.1987.4767957

Abstract

The design of a pattern recognition system requires careful attention to error estimation. The error rate is the most important descriptor of a classifier's performance. The commonly used estimates of error rate are based on the holdout method, the resubstitution method, and the leave-one-out method. All suffer either from large bias or large variance and their sample distributions are not known. Bootstrapping refers to a class of procedures that resample given data by computer. It permits determining the statistical properties of an estimator when very little is known about the underlying distribution and no additional samples are available. Since its publication in the last decade, the bootstrap technique has been successfully applied to many statistical estimations and inference problems. However, it has not been exploited in the design of pattern recognition systems. We report results on the application of several bootstrap techniques in estimating the error rate of 1-NN and quadratic classifiers. Our experiments show that, in most cases, the confidence interval of a bootstrap estimator of classification error is smaller than that of the leave-one-out estimator. The error of 1-NN, quadratic, and Fisher classifiers are estimated for several real data sets.

Keywords

This publication has 11 references indexed in Scilit:

Application of bootstrap and other resampling techniques: Evaluation of classifier performance
Pattern Recognition Letters, 1985
Estimating the Error Rate of a Prediction Rule: Improvement on Cross-Validation
Journal of the American Statistical Association, 1983
Computer-Intensive Methods in Statistics
Scientific American, 1983
A Leisurely Look at the Bootstrap, the Jackknife, and Cross-Validation
The American Statistician, 1983
39 Dimensionality and sample size considerations in pattern recognition practice
Published by Elsevier ,1982
The Jackknife, the Bootstrap and Other Resampling Plans
Published by Society for Industrial & Applied Mathematics (SIAM) ,1982
Bootstrap Methods: Another Look at the Jackknife
The Annals of Statistics, 1979
Clustering techniques: The user's dilemma
Pattern Recognition, 1976
Estimation of Error Rates in Discriminant Analysis
Technometrics, 1968
THE USE OF MULTIPLE MEASUREMENTS IN TAXONOMIC PROBLEMS
Annals of Eugenics, 1936

Cited by 138 articles