Value function approximation is important in modern reinforcement learni...
Goal-conditioned reinforcement learning (GCRL) refers to learning
genera...
We propose A-Crab (Actor-Critic Regularized by Average Bellman error), a...
Offline reinforcement learning (RL), which refers to decision-making fro...
We study statistical problems, such as planted clique, its variants, and...
We consider the general problem of learning about a matrix through
vecto...
Across many areas, from neural tracking to database entity resolution, m...
Dialog policy decides what and how a task-oriented dialog system will
re...