Transformer-based models have achieved stateof-the-art results in many t...
Delusional bias is a fundamental source of error in approximate Q-learni...
Deep Reinforcement Learning (RL) is proven powerful for decision making ...
In batch reinforcement learning (RL), one often constrains a learned pol...
Latent-state environments with long horizons, such as those faced by
rec...