Sample-efficient strategies for learning in the presence of noise

1 September 1999

journal article
Published by Association for Computing Machinery (ACM) in Journal of the ACM

Vol. 46 (5), 684-719
https://doi.org/10.1145/324133.324221

Abstract

In this paper, we prove various results about PAC learning in the presence of malicious noise. Our main interest is the sample size behavior of learning algorithms. We prove the first nontrivial sample complexity lower bound in this model by showing that order of ε/Δ ² + d /Δ (up to logarithmic factors) examples are necessary for PAC learning any target class of {0,1}-valued functions of VC dimension d , where ε is the desired accuracy and η = ε/(1 + ε) - Δ the malicious noise rate (it is well known that any nontrivial target class cannot be PAC learned with accuracy ε and malicious noise rate η ≥ ε/(1 + ε), this irrespective to sample complexity). We also show that this result cannot be significantly improved in general by presenting efficient learning algorithms for the class of all subsets of d elements and the class of unions of at most d intervals on the real line. This is especialy interesting as we can also show that the popular minimum disagreement strategy needs samples of size d ε/Δ ² , hence is not optimal with respect to sample size. We then discuss the use of randomized hypotheses. For these the bound ε/(1 + ε) on the noise rate is no longer true and is replaced by 2ε/(1 + 2ε). In fact, we present a generic algorithm using randomized hypotheses that can tolerate noise rates slightly larger than ε/(1 + ε) while using samples of size d /ε as in the noise-free case. Again one observes a quadratic powerlaw (in this case d ε/Δ ² , Δ = 2ε/(1 + 2ε) - η) as Δ goes to zero. We show upper and lower bounds of this order.

Keywords

This publication has 15 references indexed in Scilit:

General Bounds on the Number of Examples Needed for Learning Probabilistic Concepts
Journal of Computer and System Sciences, 1996
Efficient distribution-free learning of probabilistic concepts
Journal of Computer and System Sciences, 1994
Learnability and the Vapnik-Chervonenkis dimension
Journal of the ACM, 1989
A theory of the learnable
Communications of the ACM, 1984
On the density of families of sets
Journal of Combinatorial Theory, Series A, 1972
A combinatorial problem; stability and order for models and theories in infinitary languages
Pacific Journal of Mathematics, 1972
On the probability in the tail of a binomial distribution
Advances in Applied Probability, 1969
Monotone Convergence of Binomial Probabilities and a Generalization of Ramanujan's Equation
The Annals of Mathematical Statistics, 1968
On Deviations of the Sample Mean
The Annals of Mathematical Statistics, 1960
Some Approximations to the Binomial Distribution Function
The Annals of Mathematical Statistics, 1960

Cited by 12 articles