Streaming Algorithms for Learning with Experts: Deterministic Versus Robust

03/03/2023
by   David P. Woodruff, et al.
0

In the online learning with experts problem, an algorithm must make a prediction about an outcome on each of T days (or times), given a set of n experts who make predictions on each day (or time). The algorithm is given feedback on the outcomes of each day, including the cost of its prediction and the cost of the expert predictions, and the goal is to make a prediction with the minimum cost, specifically compared to the best expert in the set. Recent work by Srinivas, Woodruff, Xu, and Zhou (STOC 2022) introduced the study of the online learning with experts problem under memory constraints. However, often the predictions made by experts or algorithms at some time influence future outcomes, so that the input is adaptively chosen. Whereas deterministic algorithms would be robust to adaptive inputs, existing algorithms all crucially use randomization to sample a small number of experts. In this paper, we study deterministic and robust algorithms for the experts problem. We first show a space lower bound of Ω(nM/RT) for any deterministic algorithm that achieves regret R when the best expert makes M mistakes. Our result shows that the natural deterministic algorithm, which iterates through pools of experts until each expert in the pool has erred, is optimal up to polylogarithmic factors. On the positive side, we give a randomized algorithm that is robust to adaptive inputs that uses O(n/R√(T)) space for M=O(R^2 T/log^2 n), thereby showing a smooth space-regret trade-off.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/21/2022

Memory Bounds for the Experts Problem

Online learning with expert advice is a fundamental problem of sequentia...
research
03/02/2023

Improved Space Bounds for Learning with Experts

We give improved tradeoffs between space and regret for the online learn...
research
01/08/2019

Soft-Bayes: Prod for Mixtures of Experts with Log-Loss

We consider prediction with expert advice under the log-loss with the go...
research
03/03/2023

Near Optimal Memory-Regret Tradeoff for Online Learning

In the experts problem, on each of T days, an agent needs to follow the ...
research
02/27/2023

Optimal Prediction Using Expert Advice and Randomized Littlestone Dimension

A classical result in online learning characterizes the optimal mistake ...
research
03/03/2014

Cascading Randomized Weighted Majority: A New Online Ensemble Learning Algorithm

With the increasing volume of data in the world, the best approach for l...
research
05/24/2023

No-Regret Online Prediction with Strategic Experts

We study a generalization of the online binary prediction with expert ad...

Please sign up or login with your details

Forgot password? Click here to reset