Non-Asymptotic Sequential Tests for Overlapping Hypotheses and application to near optimal arm identification in bandit models

05/09/2019
by   Aurélien Garivier, et al.
0

In this paper, we study sequential testing problems with overlapping hypotheses. We first focus on the simple problem of assessing if the mean μ of a Gaussian distribution is ≥ -ϵ or ≤ϵ; if μ∈(-ϵ,ϵ), both answers are considered to be correct. Then, we consider PAC-best arm identification in a bandit model: given K probability distributions on R with means μ_1,...,μ_K, we derive the asymptotic complexity of identifying, with risk at most δ, an index I∈{1,...,K} such that μ_I≥_iμ_i -ϵ. We provide non asymptotic bounds on the error of a parallel General Likelihood Ratio Test, which can also be used for more general testing problems. We further propose lower bound on the number of observation needed to identify a correct hypothesis. Those lower bounds rely on information-theoretic arguments, and specifically on two versions of a change of measure lemma (a high-level form, and a low-level form) whose relative merits are discussed.

READ FULL TEXT
research
08/17/2020

Optimal Best-Arm Identification Methods for Tail-Risk Measures

Conditional value-at-risk (CVaR) and value-at-risk (VaR) are popular tai...
research
09/30/2022

On Best-Arm Identification with a Fixed Budget in Non-Parametric Multi-Armed Bandits

We lay the foundations of a non-parametric theory of best-arm identifica...
research
12/02/2019

Optimal Best Markovian Arm Identification with Fixed Confidence

We give a complete characterization of the sampling complexity of best M...
research
02/24/2023

On Stein's lemma in hypotheses testing in general non-asymptotic case

The problem of testing two simple hypotheses in a general probability sp...
research
07/22/2021

Bandit Quickest Changepoint Detection

Detecting abrupt changes in temporal behavior patterns is of interest in...
research
08/24/2019

Optimal best arm selection for general distributions

Given a finite set of unknown distributions or arms that can be sampled ...
research
12/15/2020

Generalized Chernoff Sampling for Active Learning and Structured Bandit Algorithms

Active learning and structured stochastic bandit problems are intimately...

Please sign up or login with your details

Forgot password? Click here to reset