Movement primitives are trainable parametric models that reproduce robot...
In classic reinforcement learning algorithms, agents make decisions at
d...
The policy gradient theorem (Sutton et al., 2000) prescribes the usage o...
Policy gradient (PG) estimators for softmax policies are ineffective wit...
Off-policy Reinforcement Learning (RL) holds the promise of better data
...
Parameterized movement primitives have been extensively used for imitati...
Movement primitives are an important policy class for real-world robotic...
The Nadaraya-Watson kernel estimator is among the most popular nonparame...
Reinforcement learning (RL) algorithms still suffer from high sample
com...