Intrinsic Motivation in Object-Action-Outcome Blending Latent Space

by   Melisa Sener, et al.

One effective approach for equipping artificial agents with sensorimotor skills is to use self-exploration. To do this efficiently is critical as time and data collection are costly. In this study, we propose an exploration mechanism that blends action, object, and action outcome representations into a latent space, where local regions are formed to host forward model learning. The agent uses intrinsic motivation to select the forward model with the highest learning progress to adapt at a given exploration step. This parallels how infants learn, as high learning progress indicates that the learning problem is neither too easy nor too difficult in the selected region. The proposed approach is validated with a simulated robot in a table-top environment. The robot interacts with different kinds of objects using a set of parameterized actions and learns the outcomes of these interactions. With the proposed approach, the robot organizes its own curriculum of learning as in existing intrinsic motivation approaches and outperforms them in terms of learning speed. Moreover, the learning regime demonstrates features that partially match infant development, in particular, the proposed system learns to predict grasp action outcomes earlier than that of push action.


page 1

page 4

page 5


Active choice of teachers, learning strategies and goals for a socially guided intrinsic motivation learner

We present an active learning architecture that allows a robot to active...

Show me the Way: Intrinsic Motivation from Demonstrations

The study of exploration in Reinforcement Learning (RL) has a long histo...

Learning and Acting in Peripersonal Space: Moving, Reaching, and Grasping

The young infant explores its body, its sensorimotor system, and the imm...

Socially Guided Intrinsic Motivation for Robot Learning of Motor Skills

This paper presents a technical approach to robot learning of motor skil...

Boredom-driven curious learning by Homeo-Heterostatic Value Gradients

This paper presents the Homeo-Heterostatic Value Gradients (HHVG) algori...

Learning Transferable Push Manipulation Skills in Novel Contexts

This paper is concerned with learning transferable forward models for pu...

Tracking Emotions: Intrinsic Motivation Grounded on Multi-Level Prediction Error Dynamics

How do cognitive agents decide what is the relevant information to learn...

Please sign up or login with your details

Forgot password? Click here to reset