Recently, graph-based planning algorithms have gained much attention to ...
Intrinsic rewards have been increasingly used to mitigate the sparse rew...
We propose a novel value-based algorithm for cooperative multi-agent
rei...
We consider the Markov Decision Process (MDP) of selecting a subset of i...
We explore value-based solutions for multi-agent reinforcement learning
...
Many real-world reinforcement learning tasks require multiple agents to ...