Learning Causal Models of Autonomous Agents using Interventions

08/21/2021
by   Pulkit Verma, et al.
0

One of the several obstacles in the widespread use of AI systems is the lack of requirements of interpretability that can enable a layperson to ensure the safe and reliable behavior of such systems. We extend the analysis of an agent assessment module that lets an AI system execute high-level instruction sequences in simulators and answer the user queries about its execution of sequences of actions. We show that such a primitive query-response capability is sufficient to efficiently derive a user-interpretable causal model of the system in stationary, fully observable, and deterministic settings. We also introduce dynamic causal decision networks (DCDNs) that capture the causal structure of STRIPS-like domains. A comparative analysis of different classes of queries is also presented in terms of the computational requirements needed to answer them and the efforts required to evaluate their responses to learn the correct model.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/11/2023

Causal Abstraction for Faithful Model Interpretation

A faithful and interpretable explanation of an AI model's behavior and i...
research
03/24/2022

Differential Assessment of Black-Box AI Agents

Much of the research on learning symbolic models of AI agents focuses on...
research
07/28/2021

Learning User-Interpretable Descriptions of Black-Box AI System Capabilities

Several approaches have been developed to answer specific questions that...
research
02/12/2020

Resolving Spurious Correlations in Causal Models of Environments via Interventions

Causal models could increase interpretability, robustness to distributio...
research
03/05/2021

Causal Analysis of Agent Behavior for AI Safety

As machine learning systems become more powerful they also become increa...
research
12/01/2021

Inducing Causal Structure for Interpretable Neural Networks

In many areas, we have well-founded insights about causal structure that...

Please sign up or login with your details

Forgot password? Click here to reset