research
∙
01/30/2023
A Novel Framework for Policy Mirror Descent with General Parametrization and Linear Convergence
Modern policy optimization methods in applied reinforcement learning, su...
research
∙
09/30/2022
Linear Convergence for Natural Policy Gradient with Log-linear Policy Parametrization
We analyze the convergence rate of the unregularized natural policy grad...
research
∙
09/23/2021