Scale-sensitive dimensions, uniform convergence, and learnability
- 1 July 1997
- journal article
- Published by Association for Computing Machinery (ACM) in Journal of the ACM
- Vol. 44 (4) , 615-631
- https://doi.org/10.1145/263867.263927
Abstract
Learnability in Valiant's PAC learning model has been shown to be strongly related to the existence of uniform laws of large numbers. These laws define a distribution-free convergence property of means to expectations uniformly over classes of random variables. Classes of real-valued functions enjoying such a property are also known as uniform Glivenko-Cantelli classes. In this paper, we prove, through a generalization of Sauer's lemma that may be interesting in its own right, a new characterization of uniform Glivenko-Cantelli classes. Our characterization yields Dudley, Gine´, and Zinn's previous characterization as a corollary. Furthermore, it is the first based on a Gine´, and Zinn's previous characterization as a corollary. Furthermore, it is the first based on a simple combinatorial quantity generalizing the Vapnik-Chervonenkis dimension. We apply this result to obtain the weakest combinatorial condition known to imply PAC learnability in the statistical regression (or “agnostic”) framework. Furthermore, we find a characterization of learnability in the probabilistic concept model, solving an open problem posed by Kearns and Schapire. These results show that the accuracy parameter plays a crucial role in determining the effective complexity of the learner's hypothesis class.Keywords
This publication has 18 references indexed in Scilit:
- A generalization of Sauer's lemmaJournal of Combinatorial Theory, Series A, 1995
- Characterizations of Learnability for Classes of {0, ..., n)-Valued FunctionsJournal of Computer and System Sciences, 1995
- Efficient distribution-free learning of probabilistic conceptsJournal of Computer and System Sciences, 1994
- Uniform and universal Glivenko-Cantelli classesJournal of Theoretical Probability, 1991
- Learnability and the Vapnik-Chervonenkis dimensionJournal of the ACM, 1989
- A lower bound for 0,1,∗ tournament codesDiscrete Mathematics, 1987
- Some Limit Theorems for Empirical ProcessesThe Annals of Probability, 1984
- Modeling by shortest data descriptionAutomatica, 1978
- On the density of families of setsJournal of Combinatorial Theory, Series A, 1972
- A combinatorial problem; stability and order for models and theories in infinitary languagesPacific Journal of Mathematics, 1972