We consider local kernel metric learning for off-policy evaluation (OPE)...
Recent works on machine learning for combinatorial optimization have sho...
We consider the offline reinforcement learning (RL) setting where the ag...
Inverse Reinforcement Learning (IRL) aims to facilitate a learner's abil...
Adversarial imitation learning alternates between learning a discriminat...
Multi-agent adversarial inverse reinforcement learning (MA-AIRL) is a re...