LTL2Action: Generalizing LTL Instructions for Multi-Task RL

02/13/2021
by   Pashootan Vaezipoor, et al.
11

We address the problem of teaching a deep reinforcement learning (RL) agent to follow instructions in multi-task environments. The combinatorial task sets we target consist of up to 10^39 unique tasks. We employ a well-known formal language – linear temporal logic (LTL) – to specify instructions, using a domain-specific vocabulary. We propose a novel approach to learning that exploits the compositional syntax and the semantics of LTL, enabling our RL agent to learn task-conditioned policies that generalize to new instructions, not observed during training. The expressive power of LTL supports the specification of a diversity of complex temporally extended behaviours that include conditionals and alternative realizations. To reduce the overhead of learning LTL semantics, we introduce an environment-agnostic LTL pretraining scheme which improves sample-efficiency in downstream environments. Experiments on discrete and continuous domains demonstrate the strength of our approach in learning to solve (unseen) tasks, given LTL instructions.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/08/2022

Learning to Follow Instructions in Text-Based Games

Text-based games present a unique class of sequential decision making pr...
research
04/07/2022

A Framework for Following Temporal Logic Instructions with Unknown Causal Dependencies

Teaching a deep reinforcement learning (RL) agent to follow instructions...
research
09/06/2023

Reinforcement Learning of Action and Query Policies with LTL Instructions under Uncertain Event Detector

Reinforcement learning (RL) with linear temporal logic (LTL) objectives ...
research
10/09/2021

Learning to Follow Language Instructions with Compositional Policies

We propose a framework that learns to execute natural language instructi...
research
12/21/2018

Learning to Navigate the Web

Learning in environments with large state and action spaces, and sparse ...
research
10/18/2021

In a Nutshell, the Human Asked for This: Latent Goals for Following Temporal Specifications

We address the problem of building agents whose goal is to satisfy out-o...
research
01/26/2022

Learning Invariable Semantical Representation from Language for Extensible Policy Generalization

Recently, incorporating natural language instructions into reinforcement...

Please sign up or login with your details

Forgot password? Click here to reset