research
∙
03/01/2023
The Point to Which Soft Actor-Critic Converges
Soft actor-critic is a successful successor over soft Q-learning. While ...
research
∙
02/01/2023
Distillation Policy Optimization
On-policy algorithms are supposed to be stable, however, sample-intensiv...
research
∙
08/19/2022