Do Androids Dream of Electric Fences? Safety-Aware Reinforcement Learning with Latent Shielding

12/21/2021
by   Peter He, et al.
8

The growing trend of fledgling reinforcement learning systems making their way into real-world applications has been accompanied by growing concerns for their safety and robustness. In recent years, a variety of approaches have been put forward to address the challenges of safety-aware reinforcement learning; however, these methods often either require a handcrafted model of the environment to be provided beforehand, or that the environment is relatively simple and low-dimensional. We present a novel approach to safety-aware deep reinforcement learning in high-dimensional environments called latent shielding. Latent shielding leverages internal representations of the environment learnt by model-based agents to "imagine" future trajectories and avoid those deemed unsafe. We experimentally demonstrate that this approach leads to improved adherence to formally-defined safety specifications.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/27/2020

Some Insights into Lifelong Reinforcement Learning Systems

A lifelong reinforcement learning system is a learning system that has t...
research
09/12/2018

Combined Reinforcement Learning via Abstract Representations

In the quest for efficient and robust reinforcement learning methods, bo...
research
11/10/2021

Look Before You Leap: Safe Model-Based Reinforcement Learning with Human Intervention

Safety has become one of the main challenges of applying deep reinforcem...
research
09/16/2022

Trustworthy Reinforcement Learning Against Intrinsic Vulnerabilities: Robustness, Safety, and Generalizability

A trustworthy reinforcement learning algorithm should be competent in so...
research
08/11/2020

Model-Based Deep Reinforcement Learning for High-Dimensional Problems, a Survey

Deep reinforcement learning has shown remarkable success in the past few...
research
01/27/2023

SNeRL: Semantic-aware Neural Radiance Fields for Reinforcement Learning

As previous representations for reinforcement learning cannot effectivel...
research
10/18/2021

In a Nutshell, the Human Asked for This: Latent Goals for Following Temporal Specifications

We address the problem of building agents whose goal is to satisfy out-o...

Please sign up or login with your details

Forgot password? Click here to reset