Correlation Priors for Reinforcement Learning

09/11/2019
by   Bastian Alt, et al.
0

Many decision-making problems naturally exhibit pronounced structures inherited from the underlying characteristics of the environment. In a Markov decision process model, for example, two distinct states can have inherently related semantics or encode resembling physical state configurations, often implying locally correlated transition dynamics among the states. In order to complete a certain task, an agent acting in such environments needs to execute a series of temporally and spatially correlated actions. Though there exists a variety of approaches to account for correlations in continuous state-action domains, a principled solution for discrete environments is missing. In this work, we present a Bayesian learning framework based on Pólya-Gamma augmentation that enables an analogous reasoning in such cases. We demonstrate the framework on a number of common decision-making related tasks, such as reinforcement learning, imitation learning and system identification. By explicitly modeling the underlying correlation structures, the proposed approach yields superior predictive performance compared to correlation-agnostic models, even when trained on data sets that are up to an order of magnitude smaller in size.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/03/2020

Tracking the Race Between Deep Reinforcement Learning and Imitation Learning – Extended Version

Learning-based approaches for solving large sequential decision making p...
research
03/09/2020

Learning discrete state abstractions with deep variational inference

Abstraction is crucial for effective sequential decision making in domai...
research
03/29/2021

Robust Reinforcement Learning under model misspecification

Reinforcement learning has achieved remarkable performance in a wide ran...
research
10/05/2020

Learning to Generalize for Sequential Decision Making

We consider problems of making sequences of decisions to accomplish task...
research
06/27/2023

Learning non-Markovian Decision-Making from State-only Sequences

Conventional imitation learning assumes access to the actions of demonst...
research
03/28/2022

REPTILE: A Proactive Real-Time Deep Reinforcement Learning Self-adaptive Framework

In this work a general framework is proposed to support the development ...
research
11/12/2017

Quickest Detection of Markov Networks

Detecting correlation structures in large networks arises in many domain...

Please sign up or login with your details

Forgot password? Click here to reset