Interactive Learning of Environment Dynamics for Sequential Tasks

07/19/2019
by   Robert Loftin, et al.
5

In order for robots and other artificial agents to efficiently learn to perform useful tasks defined by an end user, they must understand not only the goals of those tasks, but also the structure and dynamics of that user's environment. While existing work has looked at how the goals of a task can be inferred from a human teacher, the agent is often left to learn about the environment on its own. To address this limitation, we develop an algorithm, Behavior Aware Modeling (BAM), which incorporates a teacher's knowledge into a model of the transition dynamics of an agent's environment. We evaluate BAM both in simulation and with real human teachers, learning from a combination of task demonstrations and evaluative feedback, and show that it can outperform approaches which do not explicitly consider this source of dynamics knowledge.

READ FULL TEXT

page 4

page 9

page 10

page 11

research
01/15/2014

Interactive Policy Learning through Confidence-Based Autonomy

We present Confidence-Based Autonomy (CBA), an interactive algorithm for...
research
10/09/2018

Investigating Enactive Learning for Autonomous Intelligent Agents

The enactive approach to cognition is typically proposed as a viable alt...
research
11/15/2020

Predicting Human Strategies in Simulated Search and Rescue Task

In a search and rescue scenario, rescuers may have different knowledge o...
research
10/21/2017

Human Learning of Unknown Environments in Agile Guidance Tasks

Trained human pilots or operators still stand out through their efficien...
research
11/12/2022

The Expertise Problem: Learning from Specialized Feedback

Reinforcement learning from human feedback (RLHF) is a powerful techniqu...
research
05/28/2017

Listen, Interact and Talk: Learning to Speak via Interaction

One of the long-term goals of artificial intelligence is to build an age...
research
09/18/2012

Evolution and the structure of learning agents

This paper presents the thesis that all learning agents of finite inform...

Please sign up or login with your details

Forgot password? Click here to reset