A Reduction-based Framework for Sequential Decision Making with Delayed Feedback

02/03/2023
by   Yunchang Yang, et al.
0

We study stochastic delayed feedback in general multi-agent sequential decision making, which includes bandits, single-agent Markov decision processes (MDPs), and Markov games (MGs). We propose a novel reduction-based framework, which turns any multi-batched algorithm for sequential decision making with instantaneous feedback into a sample-efficient algorithm that can handle stochastic delays in sequential decision making. By plugging different multi-batched algorithms into our framework, we provide several examples demonstrating that our framework not only matches or improves existing results for bandits, tabular MDPs, and tabular MGs, but also provides the first line of studies on delays in sequential decision making with function approximation. In summary, we provide a complete set of sharp results for multi-agent sequential decision making with delayed feedback.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/26/2021

On Blame Attribution for Accountable Multi-Agent Sequential Decision Making

Blame attribution is one of the key aspects of accountable decision maki...
research
07/01/2021

Markov Decision Process modeled with Bandits for Sequential Decision Making in Linear-flow

In membership/subscriber acquisition and retention, we sometimes need to...
research
03/06/2013

Deliberation Scheduling for Time-Critical Sequential Decision Making

We describe a method for time-critical decision making involving sequent...
research
07/01/2023

Provably Efficient UCB-type Algorithms For Learning Predictive State Representations

The general sequential decision-making problem, which includes Markov de...
research
03/23/2020

Anticipatory Psychological Models for Quickest Change Detection: Human Sensor Interaction

We consider anticipatory psychological models for human decision makers ...
research
06/27/2022

Utility Theory for Sequential Decision Making

The von Neumann-Morgenstern (VNM) utility theorem shows that under certa...
research
08/01/2011

Exploiting Agent and Type Independence in Collaborative Graphical Bayesian Games

Efficient collaborative decision making is an important challenge for mu...

Please sign up or login with your details

Forgot password? Click here to reset