We consider the problem of multi-task reasoning (MTR), where an agent ca...
Risk-averse problems receive far less attention than risk-neutral contro...
Human-robot interactive decision-making is increasingly becoming ubiquit...
Off-policy policy optimization is a challenging problem in reinforcement...
Policy evaluation algorithms are essential to reinforcement learning due...
Recent successes of Reinforcement Learning (RL) allow an agent to learn
...
Conventional reinforcement learning (RL) allows an agent to learn polici...
Deep reinforcement learning (DRL) algorithms have achieved great success...
Deep reinforcement learning (DRL) has gained great success by learning
d...
Deep reinforcement learning (DRL) has gained great success by learning
d...
Risk management in dynamic decision problems is a primary concern in man...
Reinforcement learning and symbolic planning have both been used to buil...
Temporal difference learning and Residual Gradient methods are the most
...