λ-AC: Learning latent decision-aware models for reinforcement learning in continuous state-spaces

06/30/2023
by   Claas A Voelcker, et al.
0

The idea of decision-aware model learning, that models should be accurate where it matters for decision-making, has gained prominence in model-based reinforcement learning. While promising theoretical results have been established, the empirical performance of algorithms leveraging a decision-aware loss has been lacking, especially in continuous control problems. In this paper, we present a study on the necessary components for decision-aware reinforcement learning models and we showcase design choices that enable well-performing algorithms. To this end, we provide a theoretical and empirical investigation into prominent algorithmic ideas in the field. We highlight that empirical design decisions established in the MuZero line of works are vital to achieving good performance for related algorithms, and we showcase differences in behavior between different instantiations of value-aware algorithms in stochastic environments. Using these insights, we propose the Latent Model-Based Decision-Aware Actor-Critic framework (λ-AC) for decision-aware model-based reinforcement learning in continuous state-spaces and highlight important design choices in different environments.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/13/2020

Critic PI2: Master Continuous Planning via Policy Improvement with Path Integrals and Deep Actor-Critic Reinforcement Learning

Constructing agents with planning capabilities has long been one of the ...
research
08/06/2020

Contrastive Variational Model-Based Reinforcement Learning for Complex Observations

Deep model-based reinforcement learning (MBRL) has achieved great sample...
research
06/12/2020

Continuous Control for Searching and Planning with a Learned Model

Decision-making agents with planning capabilities have achieved huge suc...
research
12/16/2021

Sample-Efficient Reinforcement Learning via Conservative Model-Based Actor-Critic

Model-based reinforcement learning algorithms, which aim to learn a mode...
research
01/24/2023

Minimal Value-Equivalent Partial Models for Scalable and Robust Planning in Lifelong Reinforcement Learning

Learning models of the environment from pure interaction is often consid...
research
06/26/2021

Model-Advantage Optimization for Model-Based Reinforcement Learning

Model-based Reinforcement Learning (MBRL) algorithms have been tradition...
research
06/15/2023

Deep Generative Models for Decision-Making and Control

Deep model-based reinforcement learning methods offer a conceptually sim...

Please sign up or login with your details

Forgot password? Click here to reset