Guided Policy Search Model-based Reinforcement Learning for Urban Autonomous Driving

05/06/2020
by   Zhuo Xu, et al.
0

In this paper, we continue our prior work on using imitation learning (IL) and model free reinforcement learning (RL) to learn driving policies for autonomous driving in urban scenarios, by introducing a model based RL method to drive the autonomous vehicle in the Carla urban driving simulator. Although IL and model free RL methods have been proved to be capable of solving lots of challenging tasks, including playing video games, robots, and, in our prior work, urban driving, the low sample efficiency of such methods greatly limits their applications on actual autonomous driving. In this work, we developed a model based RL algorithm of guided policy search (GPS) for urban driving tasks. The algorithm iteratively learns a parameterized dynamic model to approximate the complex and interactive driving task, and optimizes the driving policy under the nonlinear approximate dynamic model. As a model based RL approach, when applied in urban autonomous driving, the GPS has the advantages of higher sample efficiency, better interpretability, and greater stability. We provide extensive experiments validating the effectiveness of the proposed method to learn robust driving policy for urban driving in Carla. We also compare the proposed method with other policy search and model free RL baselines, showing 100x better sample efficiency of the GPS based RL method, and also that the GPS based method can learn policies for harder tasks that the baseline methods can hardly learn.

READ FULL TEXT

page 1

page 3

page 5

page 6

research
04/20/2019

Model-free Deep Reinforcement Learning for Urban Autonomous Driving

Urban autonomous driving decision making is challenging due to complex r...
research
06/23/2021

Uncertainty-Aware Model-Based Reinforcement Learning with Application to Autonomous Driving

To further improve the learning efficiency and performance of reinforcem...
research
03/02/2021

Model-based Constrained Reinforcement Learning using Generalized Control Barrier Function

Model information can be used to predict future trajectories, so it has ...
research
11/15/2018

Woulda, Coulda, Shoulda: Counterfactually-Guided Policy Search

Learning policies on data synthesized by models can in principle quench ...
research
11/25/2019

End-to-End Model-Free Reinforcement Learning for Urban Driving using Implicit Affordances

Reinforcement Learning (RL) aims at learning an optimal behavior policy ...
research
02/16/2021

Steadily Learn to Drive with Virtual Memory

Reinforcement learning has shown great potential in developing high-leve...
research
04/23/2019

Baconian: A Unified Opensource Framework for Model-Based Reinforcement Learning

Model-Based Reinforcement Learning (MBRL) is one category of Reinforceme...

Please sign up or login with your details

Forgot password? Click here to reset