Online Fair Revenue Maximizing Cake Division with Non-Contiguous Pieces in Adversarial Bandits

11/29/2021
by   Mohammad Ghodsi, et al.
0

The classic cake-cutting problem provides a model for addressing the fair and efficient allocation of a divisible, heterogeneous resource among agents with distinct preferences. Focusing on a standard formulation of cake cutting, in which each agent must receive a contiguous piece of the cake in an offline setting, this work instead focuses on online allocating non-contiguous pieces of cake among agents and establishes algorithmic results for fairness measures. In this regard, we made use of classic adversarial multi-armed bandits to achieve sub-linear Fairness and Revenue Regret at the same time. Adversarial bandits are powerful tools to model the adversarial reinforcement learning environments, that provide strong upper-bounds for regret of learning with just observing one action's reward in each step by applying smart trade-off between exploration and exploitation. This work studies the power of the famous EXP_3 algorithm that is based on exponential wight-importance updating probability distribution through time horizon.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/25/2019

Fair and Efficient Cake Division with Connected Pieces

The classic cake-cutting problem provides a model for addressing fair an...
research
03/19/2018

What Doubling Tricks Can and Can't Do for Multi-Armed Bandits

An online reinforcement learning algorithm is anytime if it does not nee...
research
10/27/2021

Fairer LP-based Online Allocation

In this paper, we consider a Linear Program (LP)-based online resource a...
research
05/23/2016

Fairness in Learning: Classic and Contextual Bandits

We introduce the study of fairness in multi-armed bandit problems. Our f...
research
05/31/2020

Fair Cake Division Under Monotone Likelihood Ratios

This work develops algorithmic results for the classic cake-cutting prob...
research
11/28/2018

Adversarial Bandits with Knapsacks

We consider Bandits with Knapsacks (henceforth, BwK), a general model fo...

Please sign up or login with your details

Forgot password? Click here to reset