RAPid-Learn: A Framework for Learning to Recover for Handling Novelties in Open-World Environments

06/24/2022
by   Shivam Goel, et al.
1

We propose RAPid-Learn: Learning to Recover and Plan Again, a hybrid planning and learning method, to tackle the problem of adapting to sudden and unexpected changes in an agent's environment (i.e., novelties). RAPid-Learn is designed to formulate and solve modifications to a task's Markov Decision Process (MDPs) on-the-fly and is capable of exploiting domain knowledge to learn any new dynamics caused by the environmental changes. It is capable of exploiting the domain knowledge to learn action executors which can be further used to resolve execution impasses, leading to a successful plan execution. This novelty information is reflected in its updated domain model. We demonstrate its efficacy by introducing a wide variety of novelties in a gridworld environment inspired by Minecraft, and compare our algorithm with transfer learning baselines from the literature. Our method is (1) effective even in the presence of multiple novelties, (2) more sample efficient than transfer learning RL baselines, and (3) robust to incomplete model information, as opposed to pure symbolic planning approaches.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/14/2023

Graph schemas as abstractions for transfer learning, inference, and planning

We propose schemas as a model for abstractions that can be used for rapi...
research
12/24/2020

SPOTTER: Extending Symbolic Planning Operators through Targeted Reinforcement Learning

Symbolic planning models allow decision-making agents to sequence action...
research
12/12/2012

Reinforcement Learning with Partially Known World Dynamics

Reinforcement learning would enjoy better success on real-world problems...
research
11/18/2020

Domain Concretization from Examples: Addressing Missing Domain Knowledge via Robust Planning

The assumption of complete domain knowledge is not warranted for robot p...
research
11/01/2020

Semantic Task Planning for Service Robots in Open World

In this paper, we present a planning system based on semantic reasoning ...
research
03/24/2023

Learning to Operate in Open Worlds by Adapting Planning Models

Planning agents are ill-equipped to act in novel situations in which the...
research
11/07/2022

A Transfer Learning Approach for UAV Path Design with Connectivity Outage Constraint

The connectivity-aware path design is crucial in the effective deploymen...

Please sign up or login with your details

Forgot password? Click here to reset