Causal Inference Q-Network: Toward Resilient Reinforcement Learning

02/18/2021
by   Chao-Han Huck Yang, et al.
0

Deep reinforcement learning (DRL) has demonstrated impressive performance in various gaming simulators and real-world applications. In practice, however, a DRL agent may receive faulty observation by abrupt interferences such as black-out, frozen-screen, and adversarial perturbation. How to design a resilient DRL algorithm against these rare but mission-critical and safety-crucial scenarios is an important yet challenging task. In this paper, we consider a generative DRL framework training with an auxiliary task of observational interferences such as artificial noises. Under this framework, we discuss the importance of the causal relation and propose a causal inference based DRL algorithm called causal inference Q-network (CIQ). We evaluate the performance of CIQ in several benchmark DRL environments with different types of interferences as auxiliary labels. Our experimental results show that the proposed CIQ method could achieve higher performance and more resilience against observational interferences.

READ FULL TEXT

page 11

page 14

page 15

page 17

page 18

page 20

page 21

page 23

research
11/28/2022

Causal Deep Reinforcement Learning using Observational Data

Deep reinforcement learning (DRL) requires the collection of plenty of i...
research
11/29/2021

Pessimistic Model Selection for Offline Deep Reinforcement Learning

Deep Reinforcement Learning (DRL) has demonstrated great potentials in s...
research
05/05/2022

A Temporal-Pattern Backdoor Attack to Deep Reinforcement Learning

Deep reinforcement learning (DRL) has made significant achievements in m...
research
03/22/2021

Enhancing the Generalization Performance and Speed Up Training for DRL-based Mapless Navigation

Training an agent to navigate with DRL is data-hungry, which requires mi...
research
03/21/2022

ReCCoVER: Detecting Causal Confusion for Explainable Reinforcement Learning

Despite notable results in various fields over the recent years, deep re...
research
06/22/2020

Provably Efficient Causal Reinforcement Learning with Confounded Observational Data

Empowered by expressive function approximators such as neural networks, ...
research
04/29/2021

Hypernetwork Dismantling via Deep Reinforcement Learning

Network dismantling aims to degrade the connectivity of a network by rem...

Please sign up or login with your details

Forgot password? Click here to reset