Expected Worst Case Regret via Stochastic Sequential Covering

09/09/2022
โˆ™
by   Changlong Wu, et al.
โˆ™
2
โˆ™

We study the problem of sequential prediction and online minimax regret with stochastically generated features under a general loss function. We introduce a notion of expected worst case minimax regret that generalizes and encompasses prior known minimax regrets. For such minimax regrets we establish tight upper bounds via a novel concept of stochastic global sequential covering. We show that for a hypothesis class of VC-dimension ๐–ต๐–ข and i.i.d. generated features of length T, the cardinality of the stochastic global sequential covering can be upper bounded with high probability (whp) by e^O(๐–ต๐–ขยทlog^2 T). We then improve this bound by introducing a new complexity measure called the Star-Littlestone dimension, and show that classes with Star-Littlestone dimension ๐–ฒ๐–ซ admit a stochastic global sequential covering of order e^O(๐–ฒ๐–ซยทlog T). We further establish upper bounds for real valued classes with finite fat-shattering numbers. Finally, by applying information-theoretic tools of the fixed design minimax regrets, we provide lower bounds for the expected worst case minimax regret. We demonstrate the effectiveness of our approach by establishing tight bounds on the expected worst case minimax regrets for logarithmic loss and general mixable losses.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
โˆ™ 02/13/2021

Sequential prediction under log-loss with side information

The problem of online prediction with sequential side information under ...
research
โˆ™ 02/02/2021

Majorizing Measures, Sequential Complexities, and Online Learning

We introduce the technique of generic chaining and majorizing measures f...
research
โˆ™ 05/07/2022

Precise Regret Bounds for Log-loss via a Truncated Bayesian Algorithm

We study the sequential general online regression, known also as the seq...
research
โˆ™ 01/31/2023

Online Learning in Dynamically Changing Environments

We study the problem of online learning and online regret minimization w...
research
โˆ™ 12/10/2014

Generalised Entropy MDPs and Minimax Regret

Bayesian methods suffer from the problem of how to specify prior beliefs...
research
โˆ™ 03/13/2023

Universal coding, intrinsic volumes, and metric complexity

We study sequential probability assignment in the Gaussian setting, wher...
research
โˆ™ 03/05/2023

On the Capacity Limits of Privileged ERM

We study the supervised learning paradigm called Learning Using Privileg...

Please sign up or login with your details

Forgot password? Click here to reset