A Characterization of Multiclass Learnability

03/03/2022
by   Nataly Brukhim, et al.
0

A seminal result in learning theory characterizes the PAC learnability of binary classes through the Vapnik-Chervonenkis dimension. Extending this characterization to the general multiclass setting has been open since the pioneering works on multiclass PAC learning in the late 1980s. This work resolves this problem: we characterize multiclass PAC learnability through the DS dimension, a combinatorial dimension defined by Daniely and Shalev-Shwartz (2014). The classical characterization of the binary case boils down to empirical risk minimization. In contrast, our characterization of the multiclass case involves a variety of algorithmic ideas; these include a natural setting we call list PAC learning. In the list learning setting, instead of predicting a single outcome for a given unseen input, the goal is to provide a short menu of predictions. Our second main result concerns the Natarajan dimension, which has been a central candidate for characterizing multiclass learnability. This dimension was introduced by Natarajan (1988) as a barrier for PAC learning. Whether the Natarajan dimension characterizes PAC learnability in general has been posed as an open question in several papers since. This work provides a negative answer: we construct a non-learnable class with Natarajan dimension one. For the construction, we identify a fundamental connection between concept classes and topology (i.e., colorful simplicial complexes). We crucially rely on a deep and involved construction of hyperbolic pseudo-manifolds by Januszkiewicz and Swiatkowski. It is interesting that hyperbolicity is directly related to learning problems that are difficult to solve although no obvious barriers exist. This is another demonstration of the fruitful links machine learning has with different areas in mathematics.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/07/2022

A Characterization of List Learnability

A classical result in learning theory shows the equivalence of PAC learn...
research
08/19/2023

Computing the Vapnik Chervonenkis Dimension for Non-Discrete Settings

In 1984, Valiant [ 7 ] introduced the Probably Approximately Correct (PA...
research
07/18/2021

A Theory of PAC Learnability of Partial Concept Classes

We extend the theory of PAC learning in a way which allows to model a ri...
research
04/18/2023

Impossibility of Characterizing Distribution Learning – a simple solution to a long-standing problem

We consider the long-standing question of finding a parameter of a class...
research
05/27/2011

PAC learnability under non-atomic measures: a problem by Vidyasagar

In response to a 1997 problem of M. Vidyasagar, we state a criterion for...
research
03/20/2019

A Learning Framework for Distribution-Based Game-Theoretic Solution Concepts

The past few years have seen several works establishing PAC frameworks f...
research
03/27/2023

List Online Classification

We study multiclass online prediction where the learner can predict usin...

Please sign up or login with your details

Forgot password? Click here to reset