Context-Aware Safe Reinforcement Learning for Non-Stationary Environments

01/02/2021
by   Baiming Chen, et al.
0

Safety is a critical concern when deploying reinforcement learning agents for realistic tasks. Recently, safe reinforcement learning algorithms have been developed to optimize the agent's performance while avoiding violations of safety constraints. However, few studies have addressed the non-stationary disturbances in the environments, which may cause catastrophic outcomes. In this paper, we propose the context-aware safe reinforcement learning (CASRL) method, a meta-learning framework to realize safe adaptation in non-stationary environments. We use a probabilistic latent variable model to achieve fast inference of the posterior environment transition distribution given the context data. Safety constraints are then evaluated with uncertainty-aware trajectory sampling. The high cost of safety violations leads to the rareness of unsafe records in the dataset. We address this issue by enabling prioritized sampling during model training and formulating prior safety constraints with domain knowledge during constrained planning. The algorithm is evaluated in realistic safety-critical environments with non-stationary disturbances. Results show that the proposed algorithm significantly outperforms existing baselines in terms of safety and robustness.

READ FULL TEXT

page 1

page 7

research
11/21/2020

Double Meta-Learning for Data Efficient Policy Optimization in Non-Stationary Environments

We are interested in learning models of non-stationary environments, whi...
research
09/05/2023

Neurosymbolic Meta-Reinforcement Lookahead Learning Achieves Safe Self-Driving in Non-Stationary Environments

In the area of learning-driven artificial intelligence advancement, the ...
research
08/15/2020

Cautious Adaptation For Reinforcement Learning in Safety-Critical Settings

Reinforcement learning (RL) in real-world safety-critical target setting...
research
04/23/2023

System III: Learning with Domain Knowledge for Safety Constraints

Reinforcement learning agents naturally learn from extensive exploration...
research
04/25/2023

Real-time Safety Assessment of Dynamic Systems in Non-stationary Environments: A Review of Methods and Techniques

Real-time safety assessment (RTSA) of dynamic systems is a critical task...
research
04/21/2020

Identification of Compliant Contact Parameters and Admittance Force Modulation on a Non-stationary Compliant Surface

Although autonomous control of robotic manipulators has been studied for...
research
10/23/2020

Towards Safe Policy Improvement for Non-Stationary MDPs

Many real-world sequential decision-making problems involve critical sys...

Please sign up or login with your details

Forgot password? Click here to reset