(Locally) Differentially Private Combinatorial Semi-Bandits

06/01/2020
by   Xiaoyu Chen, et al.
15

In this paper, we study Combinatorial Semi-Bandits (CSB) that is an extension of classic Multi-Armed Bandits (MAB) under Differential Privacy (DP) and stronger Local Differential Privacy (LDP) setting. Since the server receives more information from users in CSB, it usually causes additional dependence on the dimension of data, which is a notorious side-effect for privacy preserving learning. However for CSB under two common smoothness assumptions <cit.>, we show it is possible to remove this side-effect. In detail, for B_∞-bounded smooth CSB under either ε-LDP or ε-DP, we prove the optimal regret bound is Θ(mB^2_∞ln T /Δϵ^2) or Θ̃(mB^2_∞ln T/Δϵ) respectively, where T is time period, Δ is the gap of rewards and m is the number of base arms, by proposing novel algorithms and matching lower bounds. For B_1-bounded smooth CSB under ε-DP, we also prove the optimal regret bound is Θ̃(mKB^2_1ln T/Δϵ) with both upper bound and lower bound, where K is the maximum number of feedback in each round. All above results nearly match corresponding non-private optimal rates, which imply there is no additional price for (locally) differentially private CSB in above common settings.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/06/2022

When Privacy Meets Partial Information: A Refined Analysis of Differentially Private Bandits

We study the problem of multi-armed bandits with ϵ-global Differential P...
research
06/04/2021

Optimal Rates of (Locally) Differentially Private Heavy-tailed Multi-Armed Bandits

In this paper we study the problem of stochastic multi-armed bandits (MA...
research
06/01/2023

Differentially Private Episodic Reinforcement Learning with Heavy-tailed Rewards

In this paper, we study the problem of (finite horizon tabular) Markov d...
research
06/01/2020

Locally Differentially Private (Contextual) Bandits Learning

We study locally differentially private (LDP) bandits learning in this p...
research
04/23/2023

Robust and differentially private stochastic linear bandits

In this paper, we study the stochastic linear bandit problem under the a...
research
07/12/2022

Differentially Private Linear Bandits with Partial Distributed Feedback

In this paper, we study the problem of global reward maximization with o...
research
01/02/2023

Local Differential Privacy for Sequential Decision Making in a Changing Environment

We study the problem of preserving privacy while still providing high ut...

Please sign up or login with your details

Forgot password? Click here to reset