Interestingness Elements for Explainable Reinforcement Learning: Understanding Agents' Capabilities and Limitations

12/19/2019
by   Pedro Sequeira, et al.
15

We propose an explainable reinforcement learning (XRL) framework that analyzes an agent's history of interaction with the environment to extract interestingness elements that help explain its behavior. The framework relies on data readily available from standard RL algorithms, augmented with data that can easily be collected by the agent while learning. We describe how to create visual explanations of an agent's behavior in the form of short video-clips highlighting key interaction moments, based on the proposed elements. We also report on a user study where we evaluated the ability of humans in correctly perceiving the aptitude of agents with different characteristics, including their capabilities and limitations, given explanations automatically generated by our framework. The results show that the diversity of aspects captured by the different interestingness elements is crucial to help humans correctly identify the agents' aptitude in the task, and determine when they might need adjustments to improve their performance.

READ FULL TEXT

page 10

page 13

page 16

page 24

research
11/22/2019

Culture-Based Explainable Human-Agent Deconfliction

Law codes and regulations help organise societies for centuries, and as ...
research
02/24/2023

GANterfactual-RL: Understanding Reinforcement Learning Agents' Strategies through Visual Counterfactual Explanations

Counterfactual explanations are a common tool to explain artificial inte...
research
05/17/2023

Explainable Multi-Agent Reinforcement Learning for Temporal Queries

As multi-agent reinforcement learning (MARL) systems are increasingly de...
research
09/24/2022

Explainable Reinforcement Learning via Model Transforms

Understanding emerging behaviors of reinforcement learning (RL) agents m...
research
11/11/2022

Global and Local Analysis of Interestingness for Competency-Aware Deep Reinforcement Learning

In recent years, advances in deep learning have resulted in a plethora o...
research
06/07/2015

A Framework for Constrained and Adaptive Behavior-Based Agents

Behavior Trees are commonly used to model agents for robotics and games,...
research
04/25/2023

A Closer Look at Reward Decomposition for High-Level Robotic Explanations

Explaining the behavior of intelligent agents such as robots to humans i...

Please sign up or login with your details

Forgot password? Click here to reset