Learning of Generalizable and Interpretable Knowledge in Grid-Based Reinforcement Learning Environments

09/07/2023
by   Manuel Eberhardinger, et al.
0

Understanding the interactions of agents trained with deep reinforcement learning is crucial for deploying agents in games or the real world. In the former, unreasonable actions confuse players. In the latter, that effect is even more significant, as unexpected behavior cause accidents with potentially grave and long-lasting consequences for the involved individuals. In this work, we propose using program synthesis to imitate reinforcement learning policies after seeing a trajectory of the action sequence. Programs have the advantage that they are inherently interpretable and verifiable for correctness. We adapt the state-of-the-art program synthesis system DreamCoder for learning concepts in grid-based environments, specifically, a navigation task and two miniature versions of Atari games, Space Invaders and Asterix. By inspecting the generated libraries, we can make inferences about the concepts the black-box agent has learned and better understand the agent's behavior. We achieve the same by visualizing the agent's decision-making process for the imitated sequences. We evaluate our approach with different types of program synthesizers based on a search-only method, a neural-guided search, and a language model fine-tuned on code.

READ FULL TEXT

page 4

page 6

page 7

page 8

page 9

research
02/01/2019

Visual Rationalizations in Deep Reinforcement Learning for Atari Games

Due to the capability of deep learning to perform well in high dimension...
research
02/22/2021

Program Synthesis Guided Reinforcement Learning

A key challenge for reinforcement learning is solving long-horizon plann...
research
07/01/2018

Towards Mixed Optimization for Reinforcement Learning with Program Synthesis

Deep reinforcement learning has led to several recent breakthroughs, tho...
research
12/09/2021

Learning Generalizable Behavior via Visual Rewrite Rules

Though deep reinforcement learning agents have achieved unprecedented su...
research
09/10/2020

TripleTree: A Versatile Interpretable Representation of Black Box Agents and their Environments

In explainable artificial intelligence, there is increasing interest in ...
research
05/07/2020

CARL: Controllable Agent with Reinforcement Learning for Quadruped Locomotion

Motion synthesis in a dynamic environment has been a long-standing probl...
research
05/20/2018

Unsupervised Video Object Segmentation for Deep Reinforcement Learning

We present a new technique for deep reinforcement learning that automati...

Please sign up or login with your details

Forgot password? Click here to reset