On the convergence of optimistic policy iteration for stochastic shortest path problem

08/27/2018
by   Yuanlong Chen, et al.
0

In this paper, we prove some convergence results of a special case of optimistic policy iteration algorithm for stochastic shortest path problem. We consider the Monte Carlo and TD(λ) methods for the policy evaluation step under the condition that the termination state will be reached almost surely.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset
Success!
Error Icon An error occurred

Sign in with Google

×

Use your Google Account to sign in to DeepAI

×

Consider DeepAI Pro