Multiclass Online Learnability under Bandit Feedback

08/08/2023
by   Ananth Raman, et al.
0

We study online multiclass classification under bandit feedback. We extend the results of (daniely2013price) by showing that the finiteness of the Bandit Littlestone dimension is necessary and sufficient for bandit online multiclass learnability even when the label space is unbounded. Our result complements the recent work by (hanneke2023multiclass) who show that the Littlestone dimension characterizes online multiclass learnability in the full-information setting when the label space is unbounded.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset