Parameter-free online learning via model selection

12/30/2017
by   Dylan J. Foster, et al.
0

We introduce an efficient algorithmic framework for model selection in online learning, also known as parameter-free online learning. Departing from previous work, which has focused on highly structured function classes such as nested balls in Hilbert space, we propose a generic meta-algorithm framework that achieves online model selection oracle inequalities under minimal structural assumptions. We give the first computationally efficient parameter-free algorithms that work in arbitrary Banach spaces under mild smoothness assumptions; previous results applied only to Hilbert spaces. We further derive new oracle inequalities for matrix classes, non-nested convex sets, and R^d with generic regularizers. Finally, we generalize these results by providing oracle inequalities for arbitrary non-linear classes in the online supervised learning model. These results are all derived through a unified meta-algorithm scheme using a novel "multi-scale" algorithm for prediction with expert advice based on random playout, which may be of independent interest.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/01/2012

Oracle inequalities for computationally adaptive model selection

We analyze general model selection procedures using penalized empirical ...
research
11/03/2022

Oracle Inequalities for Model Selection in Offline Reinforcement Learning

In offline reinforcement learning (RL), a learner leverages prior logged...
research
04/13/2017

ZigZag: A new approach to adaptive online learning

We develop a novel family of algorithms for the online learning setting ...
research
03/20/2018

Online Learning: Sufficient Statistics and the Burkholder Method

We uncover a fairly general principle in online learning: If regret can ...
research
08/15/2023

Simple online learning with consistency oracle

We consider online learning in the model where a learning algorithm can ...
research
12/15/2017

Oracle inequalities for the stochastic differential equations

This paper is a survey of recent results on the adaptive robust non para...
research
03/15/2017

Online Learning for Distribution-Free Prediction

We develop an online learning method for prediction, which is important ...

Please sign up or login with your details

Forgot password? Click here to reset