Safety-aware Policy Optimisation for Autonomous Racing

10/14/2021
by   Bingqing Chen, et al.
0

To be viable for safety-critical applications, such as autonomous driving and assistive robotics, autonomous agents should adhere to safety constraints throughout the interactions with their environments. Instead of learning about safety by collecting samples, including unsafe ones, methods such as Hamilton-Jacobi (HJ) reachability compute safe sets with theoretical guarantees using models of the system dynamics. However, HJ reachability is not scalable to high-dimensional systems, and the guarantees hinge on the quality of the model. In this work, we inject HJ reachability theory into the constrained Markov decision process (CMDP) framework, as a control-theoretical approach for safety analysis via model-free updates on state-action pairs. Furthermore, we demonstrate that the HJ safety value can be learned directly on vision context, the highest-dimensional problem studied via the method to-date. We evaluate our method on several benchmark tasks, including Safety Gym and Learn-to-Race (L2R), a recently-released high-fidelity autonomous racing environment. Our approach has significantly fewer constraint violations in comparison to other constrained RL baselines, and achieve the new state-of-the-art results on the L2R benchmark task.

READ FULL TEXT

page 7

page 14

page 17

research
02/24/2020

Safe reinforcement learning for probabilistic reachability and safety specifications: A Lyapunov-based approach

Emerging applications in robotics and autonomous systems, such as autono...
research
01/15/2021

Scalable Learning of Safety Guarantees for Autonomous Systems using Hamilton-Jacobi Reachability

Autonomous systems like aircraft and assistive robots often operate in s...
research
06/21/2023

State-wise Constrained Policy Optimization

Reinforcement Learning (RL) algorithms have shown tremendous success in ...
research
11/04/2020

DeepReach: A Deep Learning Approach to High-Dimensional Reachability

Hamilton-Jacobi (HJ) reachability analysis is an important formal verifi...
research
05/05/2022

Learn-to-Race Challenge 2022: Benchmarking Safe Learning and Cross-domain Generalisation in Autonomous Racing

We present the results of our autonomous racing virtual challenge, based...
research
10/19/2022

Provably Safe Reinforcement Learning via Action Projection using Reachability Analysis and Polynomial Zonotopes

While reinforcement learning produces very promising results for many ap...
research
09/29/2022

Parameter-Conditioned Reachable Sets for Updating Safety Assurances Online

Hamilton-Jacobi (HJ) reachability analysis is a powerful tool for analyz...

Please sign up or login with your details

Forgot password? Click here to reset