Zero-Shot Assistance in Novel Decision Problems

02/15/2022
by   Sebastiaan De Peuter, et al.
0

We consider the problem of creating assistants that can help agents - often humans - solve novel sequential decision problems, assuming the agent is not able to specify the reward function explicitly to the assistant. Instead of aiming to automate, and act in place of the agent as in current approaches, we give the assistant an advisory role and keep the agent in the loop as the main decision maker. The difficulty is that we must account for potential biases induced by limitations or constraints of the agent which may cause it to seemingly irrationally reject advice. To do this we introduce a novel formalization of assistance that models these biases, allowing the assistant to infer and adapt to them. We then introduce a new method for planning the assistant's advice which can scale to large decision making problems. Finally, we show experimentally that our approach adapts to these agent biases, and results in higher cumulative reward for the agent than automation-based alternatives.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/03/2023

Learning Zero-Shot Cooperation with Humans, Assuming Humans Are Biased

There is a recent trend of applying multi-agent reinforcement learning (...
research
06/01/2023

Extracting Reward Functions from Diffusion Models

Diffusion models have achieved remarkable results in image generation, a...
research
06/05/2022

Sequential Counterfactual Decision-Making Under Confounded Reward

We investigate the limitations of random trials when the cause of intere...
research
06/23/2019

On the Feasibility of Learning, Rather than Assuming, Human Biases for Reward Inference

Our goal is for agents to optimize the right reward function, despite ho...
research
06/04/2017

Planning with Multiple Biases

Recent work has considered theoretical models for the behavior of agents...
research
03/12/2023

Decision Making for Human-in-the-loop Robotic Agents via Uncertainty-Aware Reinforcement Learning

In a Human-in-the-Loop paradigm, a robotic agent is able to act mostly a...

Please sign up or login with your details

Forgot password? Click here to reset