research
∙
06/27/2012
On the Sample Complexity of Reinforcement Learning with a Generative Model
We consider the problem of learning the optimal action-value function in...
research
∙
12/09/2011