Explainable Deep Reinforcement Learning: State of the Art and Challenges

01/24/2023
by   George A. Vouros, et al.
0

Interpretability, explainability and transparency are key issues to introducing Artificial Intelligence methods in many critical domains: This is important due to ethical concerns and trust issues strongly connected to reliability, robustness, auditability and fairness, and has important consequences towards keeping the human in the loop in high levels of automation, especially in critical cases for decision making, where both (human and the machine) play important roles. While the research community has given much attention to explainability of closed (or black) prediction boxes, there are tremendous needs for explainability of closed-box methods that support agents to act autonomously in the real world. Reinforcement learning methods, and especially their deep versions, are such closed-box methods. In this article we aim to provide a review of state of the art methods for explainable deep reinforcement learning methods, taking also into account the needs of human operators - i.e., of those that take the actual and critical decisions in solving real-world problems. We provide a formal specification of the deep reinforcement learning explainability problems, and we identify the necessary components of a general explainable reinforcement learning framework. Based on these, we provide a comprehensive review of state of the art methods, categorizing them in classes according to the paradigm they follow, the interpretable models they use, and the surface representation of explanations provided. The article concludes identifying open questions and important challenges.

READ FULL TEXT

page 10

page 11

page 12

research
07/05/2022

Explainability in Deep Reinforcement Learning, a Review into Current Methods and Applications

The use of Deep Reinforcement Learning (DRL) schemes has increased drama...
research
11/12/2020

Domain-Level Explainability – A Challenge for Creating Trust in Superhuman AI Strategies

For strategic problems, intelligent systems based on Deep Reinforcement ...
research
08/15/2020

Explainability in Deep Reinforcement Learning

A large set of the explainable Artificial Intelligence (XAI) literature ...
research
05/11/2022

Knowledge-powered Explainable Artificial Intelligence (XAI) for Network Automation Towards 6G

Communication networks are becoming increasingly complex towards 6G. Man...
research
03/11/2020

Explainable Agents Through Social Cues: A Review

How to provide explanations has experienced a surge of interest in Human...
research
04/02/2021

Explainable Artificial Intelligence (XAI) on TimeSeries Data: A Survey

Most of state of the art methods applied on time series consist of deep ...
research
06/02/2022

HEX: Human-in-the-loop Explainability via Deep Reinforcement Learning

The use of machine learning (ML) models in decision-making contexts, par...

Please sign up or login with your details

Forgot password? Click here to reset