When Do Drivers Concentrate? Attention-based Driver Behavior Modeling With Deep Reinforcement Learning

02/26/2020
by   Xingbo Fu, et al.
0

Driver distraction a significant risk to driving safety. Apart from spatial domain, research on temporal inattention is also necessary. In this paper, we propose an actor-critic method - Attention-based Twin Delayed Deep Deterministic policy gradient (ATD3) algorithm to approximate a driver's action according to observations and measure the driver's attention allocation for consecutive time steps in car-following model. Considering reaction time, we construct the attention mechanism in the actor network to capture temporal dependencies of consecutive observations. In the critic network, we employ Twin Delayed Deep Deterministic policy gradient algorithm (TD3) to address overestimated value estimates persisting in the actor-critic algorithm. We conduct experiments on real-world vehicle trajectory datasets and show that the accuracy of our proposed approach outperforms seven baseline algorithms. Moreover, the results reveal that the attention of the drivers in smooth vehicles is uniformly distributed in previous observations while they keep their attention to recent observations when sudden decreases of relative speeds occur. This study is the first contribution to drivers' temporal attention.

READ FULL TEXT
research
05/17/2021

Controlling an Inverted Pendulum with Policy Gradient Methods-A Tutorial

This paper provides the details of implementing two important policy gra...
research
12/25/2017

Learning to Run with Actor-Critic Ensemble

We introduce an Actor-Critic Ensemble(ACE) method for improving the perf...
research
10/09/2019

Investigation on the generalization of the Sampled Policy Gradient algorithm

The Sampled Policy Gradient (SPG) algorithm is a new offline actor-criti...
research
01/31/2022

Single Time-scale Actor-critic Method to Solve the Linear Quadratic Regulator with Convergence Guarantees

We propose a single time-scale actor-critic algorithm to solve the linea...
research
10/23/2022

Coupling User Preference with External Rewards to Enable Driver-centered and Resource-aware EV Charging Recommendation

Electric Vehicle (EV) charging recommendation that both accommodates use...
research
03/04/2019

Microscopic Traffic Simulation by Cooperative Multi-agent Deep Reinforcement Learning

Expert human drivers perform actions relying on traffic laws and their p...
research
09/07/2023

Hybrid of representation learning and reinforcement learning for dynamic and complex robotic motion planning

Motion planning is the soul of robot decision making. Classical planning...

Please sign up or login with your details

Forgot password? Click here to reset