research
∙
05/01/2022
Processing Network Controls via Deep Reinforcement Learning
Novel advanced policy gradient (APG) algorithms, such as proximal policy...
research
∙
07/16/2021
Refined Policy Improvement Bounds for MDPs
The policy improvement bound on the difference of the discounted returns...
research
∙
09/27/2020
Scalable Deep Reinforcement Learning for Ride-Hailing
Ride-hailing services, such as Didi Chuxing, Lyft, and Uber, arrange tho...
research
∙
07/31/2020