research
∙
05/31/2021
A unified view of likelihood ratio and reparameterization gradients
Reparameterization (RP) and likelihood ratio (LR) gradient estimators ar...
research
∙
10/14/2019
A unified view of likelihood ratio and reparameterization gradients and an optimal importance sampling scheme
Reparameterization (RP) and likelihood ratio (LR) gradient estimators ar...
research
∙
02/05/2019
Total stochastic gradient algorithms and applications in reinforcement learning
Backpropagation and the chain rule of derivatives have been prominent; h...
research
∙
02/04/2019