XLVIN: eXecuted Latent Value Iteration Nets

10/25/2020
by   Andreea Deac, et al.
2

Value Iteration Networks (VINs) have emerged as a popular method to incorporate planning algorithms within deep reinforcement learning, enabling performance improvements on tasks requiring long-range reasoning and understanding of environment dynamics. This came with several limitations, however: the model is not incentivised in any way to perform meaningful planning computations, the underlying state space is assumed to be discrete, and the Markov decision process (MDP) is assumed fixed and known. We propose eXecuted Latent Value Iteration Networks (XLVINs), which combine recent developments across contrastive self-supervised learning, graph representation learning and neural algorithmic reasoning to alleviate all of the above limitations, successfully deploying VIN-style models on generic environments. XLVINs match the performance of VIN-like models when the underlying MDP is discrete, fixed and known, and provides significant improvements to model-free baselines across three general MDP setups.

READ FULL TEXT
research
10/11/2021

Neural Algorithmic Reasoners are Implicit Planners

Implicit planning has emerged as an elegant technique for combining lear...
research
02/27/2021

CP-MDP: A CANDECOMP-PARAFAC Decomposition Approach to Solve a Markov Decision Process Multidimensional Problem

Markov Decision Process (MDP) is the underlying model for optimal planni...
research
11/29/2022

Continuous Neural Algorithmic Planners

Neural algorithmic reasoning studies the problem of learning algorithms ...
research
04/26/2022

BATS: Best Action Trajectory Stitching

The problem of offline reinforcement learning focuses on learning a good...
research
04/07/2016

Planning with Information-Processing Constraints and Model Uncertainty in Markov Decision Processes

Information-theoretic principles for learning and acting have been propo...
research
09/11/2023

Career Path Recommendations for Long-term Income Maximization: A Reinforcement Learning Approach

This study explores the potential of reinforcement learning algorithms t...
research
04/29/2020

Deep Reinforcement Learning with Graph-based State Representations

Deep RL approaches build much of their success on the ability of the dee...

Please sign up or login with your details

Forgot password? Click here to reset