Mutual Information-based State-Control for Intrinsically Motivated Reinforcement Learning

02/05/2020
by   Rui Zhao, et al.
6

In reinforcement learning, an agent learns to reach a set of goals by means of an external reward signal. In the natural world, intelligent organisms learn from internal drives, bypassing the need for external signals, which is beneficial for a wide range of tasks. Motivated by this observation, we propose to formulate an intrinsic objective as the mutual information between the goal states and the controllable states. This objective encourages the agent to take control of its environment. Subsequently, we derive a surrogate objective of the proposed reward function, which can be optimized efficiently. Lastly, we evaluate the developed framework in different robotic manipulation and navigation tasks and demonstrate the efficacy of our approach. A video showing experimental results is available at <https://youtu.be/CT4CKMWBYz0>.

READ FULL TEXT

page 2

page 6

page 7

page 12

research
10/11/2018

Empowerment-driven Exploration using Mutual Information Estimation

Exploration is a difficult challenge in reinforcement learning and is of...
research
05/06/2022

Dynamically writing coupled memories using a reinforcement learning agent, meeting physical bounds

Traditional memory writing operations proceed one bit at a time, where e...
research
07/16/2022

Role of reward shaping in object-goal navigation

Deep reinforcement learning approaches have been a popular method for vi...
research
03/10/2021

Hard Attention Control By Mutual Information Maximization

Biological agents have adopted the principle of attention to limit the r...
research
07/07/2023

Intelligent Robotic Sonographer: Mutual Information-based Disentangled Reward Learning from Few Demonstrations

Ultrasound (US) imaging is widely used for biometric measurement and dia...
research
02/16/2022

Open-Ended Reinforcement Learning with Neural Reward Functions

Inspired by the great success of unsupervised learning in Computer Visio...
research
12/04/2019

Learning Efficient Representation for Intrinsic Motivation

Mutual Information between agent Actions and environment States (MIAS) q...

Please sign up or login with your details

Forgot password? Click here to reset