We propose a novel model-based offline Reinforcement Learning (RL) frame...
We propose a new model-based offline RL framework, called Adversarial Mo...
Coverage conditions – which assert that the data logging distribution
ad...
Consider the problem setting of Interaction-Grounded Learning (IGL), in ...
We propose Adversarially Trained Actor Critic (ATAC), a new model-free
a...
The use of pessimism, when reasoning about datasets lacking exhaustive
e...
Recent theoretical work studies sample-efficient reinforcement learning ...
Consider a prosthetic arm, learning to adapt to its user's control signa...
We offer a theoretical characterization of off-policy evaluation (OPE) i...
Recently, Wang et al. (2020) showed a highly intriguing hardness result ...
We solve a long-standing problem in batch reinforcement learning (RL):
l...
We prove performance guarantees of two algorithms for approximating Q^
i...
Motivated by the many real-world applications of reinforcement learning ...
We take initial steps in studying PAC-MDP algorithms with limited adapti...
Many reinforcement learning applications involve the use of data that is...
Risk management in dynamic decision problems is a primary concern in man...