Sharp Deviations Bounds for Dirichlet Weighted Sums with Application to analysis of Bayesian algorithms

04/06/2023
by   Denis Belomestny, et al.
0

In this work, we derive sharp non-asymptotic deviation bounds for weighted sums of Dirichlet random variables. These bounds are based on a novel integral representation of the density of a weighted Dirichlet sum. This representation allows us to obtain a Gaussian-like approximation for the sum distribution using geometry and complex analysis methods. Our results generalize similar bounds for the Beta distribution obtained in the seminal paper Alfers and Dinges [1984]. Additionally, our results can be considered a sharp non-asymptotic version of the inverse of Sanov's theorem studied by Ganesh and O'Connell [1999] in the Bayesian setting. Based on these results, we derive new deviation bounds for the Dirichlet process posterior means with application to Bayesian bootstrap. Finally, we apply our estimates to the analysis of the Multinomial Thompson Sampling (TS) algorithm in multi-armed bandits and significantly sharpen the existing regret bounds by making them independent of the size of the arms distribution support.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/21/2018

On the Non-asymptotic and Sharp Lower Tail Bounds of Random Variables

The non-asymptotic tail bounds of random variables play crucial roles in...
research
10/21/2018

A Non-asymptotic, Sharp, and User-friendly Reverse Chernoff-Cramèr Bound

The Chernoff-Cramèr bound is a widely used technique to analyze the uppe...
research
03/19/2018

What Doubling Tricks Can and Can't Do for Multi-Armed Bandits

An online reinforcement learning algorithm is anytime if it does not nee...
research
07/04/2011

On a Rapid Simulation of the Dirichlet Process

We describe a simple and efficient procedure for approximating the Lévy ...
research
08/19/2021

A Novel Approach to Handling the Non-Central Dirichlet Distribution

In the present paper new insights into the study of the Non-central Diri...
research
05/16/2022

From Dirichlet to Rubin: Optimistic Exploration in RL without Bonuses

We propose the Bayes-UCBVI algorithm for reinforcement learning in tabul...
research
12/02/2013

Consistency of weighted majority votes

We revisit the classical decision-theoretic problem of weighted expert v...

Please sign up or login with your details

Forgot password? Click here to reset