research
∙
02/06/2021
A Hybrid Approach for Reinforcement Learning Using Virtual Policy Gradient for Balancing an Inverted Pendulum
Using the policy gradient algorithm, we train a single-hidden-layer neur...
research
∙
02/06/2021