Model-Based Safe Policy Search from Signal Temporal Logic Specifications Using Recurrent Neural Networks

03/29/2021
by   Wenliang Liu, et al.
0

We propose a policy search approach to learn controllers from specifications given as Signal Temporal Logic (STL) formulae. The system model is unknown, and it is learned together with the control policy. The model is implemented as a feedforward neural network (FNN). To capture the history dependency of the STL specification, we use a recurrent neural network (RNN) to implement the control policy. In contrast to prevalent model-free methods, the learning approach proposed here takes advantage of the learned model and is more efficient. We use control barrier functions (CBFs) with the learned model to improve the safety of the system. We validate our algorithm via simulations. The results show that our approach can satisfy the given specification within very few system runs, and therefore it has the potential to be used for on-line control.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/24/2020

Recurrent Neural Network Controllers for Signal Temporal Logic Specifications Subject to Safety Constraints

We propose a framework based on Recurrent Neural Networks (RNNs) to dete...
research
09/10/2023

Signal Temporal Logic Neural Predictive Control

Ensuring safety and meeting temporal specifications are critical challen...
research
12/10/2022

Neural Controller Synthesis for Signal Temporal Logic Specifications Using Encoder-Decoder Structured Networks

In this paper, we propose a control synthesis method for signal temporal...
research
12/23/2019

Learning an Interpretable Traffic Signal Control Policy

Signalized intersections are managed by controllers that assign right of...
research
09/16/2021

Automated Testing with Temporal Logic Specifications for Robotic Controllers using Adaptive Experiment Design

Many robot control scenarios involve assessing system robustness against...
research
03/23/2019

Temporal Logic Guided Safe Reinforcement Learning Using Control Barrier Functions

Using reinforcement learning to learn control policies is a challenge wh...
research
06/16/2023

Data-Driven Model Discrimination of Switched Nonlinear Systems with Temporal Logic Inference

This paper addresses the problem of data-driven model discrimination for...

Please sign up or login with your details

Forgot password? Click here to reset