Preference elicitation and inverse reinforcement learning

04/29/2011
by   Constantin Rothkopf, et al.
0

We state the problem of inverse reinforcement learning in terms of preference elicitation, resulting in a principled (Bayesian) statistical formulation. This generalises previous work on Bayesian inverse reinforcement learning and allows us to obtain a posterior distribution on the agent's preferences, policy and optionally, the obtained reward sequence, from observations. We examine the relation of the resulting approach to other statistical methods for inverse reinforcement learning via analysis and experimental results. We show that preferences can be determined accurately, even if the observed agent's policy is sub-optimal with respect to its own preferences. In that case, significantly improved policies with respect to the agent's preferences are obtained, compared to both other methods and to the performance of the demonstrated policy.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/09/2014

Probabilistic inverse reinforcement learning in unknown environments

We consider the problem of learning by demonstration from agents acting ...
research
06/16/2020

Preference-based Reinforcement Learning with Finite-Time Guarantees

Preference-based Reinforcement Learning (PbRL) replaces reward values in...
research
08/04/2019

Dueling Posterior Sampling for Preference-Based Reinforcement Learning

In preference-based reinforcement learning (RL), an agent interacts with...
research
06/19/2021

Learning the Preferences of Uncertain Humans with Inverse Decision Theory

Existing observational approaches for learning human preferences, such a...
research
12/18/2015

Learning the Preferences of Ignorant, Inconsistent Agents

An important use of machine learning is to learn what people value. What...
research
05/21/2018

A Framework and Method for Online Inverse Reinforcement Learning

Inverse reinforcement learning (IRL) is the problem of learning the pref...
research
10/24/2019

Rationally Inattentive Inverse Reinforcement Learning Explains YouTube Commenting Behavior

We consider a novel application of inverse reinforcement learning which ...

Please sign up or login with your details

Forgot password? Click here to reset