Online Learning: Sufficient Statistics and the Burkholder Method

03/20/2018
by   Dylan J. Foster, et al.
0

We uncover a fairly general principle in online learning: If regret can be (approximately) expressed as a function of certain "sufficient statistics" for the data sequence, then there exists a special Burkholder function that 1) can be used algorithmically to achieve the regret bound and 2) only depends on these sufficient statistics, not the entire data sequence, so that the online strategy is only required to keep the sufficient statistics in memory. This characterization is achieved by bringing the full power of the Burkholder Method --- originally developed for certifying probabilistic martingale inequalities --- to bear on the online learning setting. To demonstrate the scope and effectiveness of the Burkholder method, we develop a novel online strategy for matrix prediction that attains a regret bound corresponding to the variance term in matrix concentration inequalities. We also present a linear-time/space prediction strategy for parameter free supervised learning with linear classes and general smooth norms.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/13/2017

ZigZag: A new approach to adaptive online learning

We develop a novel family of algorithms for the online learning setting ...
research
05/29/2019

Matrix-Free Preconditioning in Online Learning

We provide an online convex optimization algorithm with regret that inte...
research
01/21/2020

TopRank+: A Refinement of TopRank Algorithm

Online learning to rank is a core problem in machine learning. In Lattim...
research
05/25/2022

Efficient and Near-Optimal Smoothed Online Learning for Generalized Linear Functions

Due to the drastic gap in complexity between sequential and batch statis...
research
12/30/2017

Parameter-free online learning via model selection

We introduce an efficient algorithmic framework for model selection in o...
research
02/19/2019

Online Learning with Continuous Variations: Dynamic Regret and Reductions

We study the dynamic regret of a new class of online learning problems, ...
research
12/05/2017

Online Learning with Gated Linear Networks

This paper describes a family of probabilistic architectures designed fo...

Please sign up or login with your details

Forgot password? Click here to reset