Decision Stacks: Flexible Reinforcement Learning via Modular Generative Models

06/09/2023
by   Siyan Zhao, et al.
0

Reinforcement learning presents an attractive paradigm to reason about several distinct aspects of sequential decision making, such as specifying complex goals, planning future observations and actions, and critiquing their utilities. However, the combined integration of these capabilities poses competing algorithmic challenges in retaining maximal expressivity while allowing for flexibility in modeling choices for efficient learning and inference. We present Decision Stacks, a generative framework that decomposes goal-conditioned policy agents into 3 generative modules. These modules simulate the temporal evolution of observations, rewards, and actions via independent generative models that can be learned in parallel via teacher forcing. Our framework guarantees both expressivity and flexibility in designing individual modules to account for key factors such as architectural bias, optimization objective and dynamics, transferrability across domains, and inference speed. Our empirical results demonstrate the effectiveness of Decision Stacks for offline policy optimization for several MDP and POMDP environments, outperforming existing methods and enabling flexible generative decision making.

READ FULL TEXT

page 2

page 7

research
07/26/2020

Data-efficient visuomotor policy training using reinforcement learning and generative models

We present a data-efficient framework for solving deep visuomotor sequen...
research
06/15/2023

Deep Generative Models for Decision-Making and Control

Deep model-based reinforcement learning methods offer a conceptually sim...
research
04/18/2022

Training and Evaluation of Deep Policies using Reinforcement Learning and Generative Models

We present a data-efficient framework for solving sequential decision-ma...
research
05/08/2022

Introduction to Soar

This paper is the recommended initial reading for a functional overview ...
research
02/08/2018

Learning and Querying Fast Generative Models for Reinforcement Learning

A key challenge in model-based reinforcement learning (RL) is to synthes...
research
06/18/2019

Inferred successor maps for better transfer learning

Humans and animals show remarkable flexibility in adjusting their behavi...
research
06/01/2023

Augmented Modular Reinforcement Learning based on Heterogeneous Knowledge

In order to mitigate some of the inefficiencies of Reinforcement Learnin...

Please sign up or login with your details

Forgot password? Click here to reset