Reinforcement Learning in Non-Markovian Environments

11/03/2022
by   Siddharth Chandak, et al.
0

Following the novel paradigm developed by Van Roy and coauthors for reinforcement learning in arbitrary non-Markovian environments, we propose a related formulation inspired by classical stochastic control that reduces the problem to recursive computation of approximate sufficient statistics.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset