Learning from Symmetry: Meta-Reinforcement Learning with Symmetric Data and Language Instructions

09/21/2022
by   Xiangtong Yao, et al.
0

Meta-reinforcement learning (meta-RL) is a promising approach that enables the agent to learn new tasks quickly. However, most meta-RL algorithms show poor generalization in multiple-task scenarios due to the insufficient task information provided only by rewards. Language-conditioned meta-RL improves the generalization by matching language instructions and the agent's behaviors. Learning from symmetry is an important form of human learning, therefore, combining symmetry and language instructions into meta-RL can help improve the algorithm's generalization and learning efficiency. We thus propose a dual-MDP meta-reinforcement learning method that enables learning new tasks efficiently with symmetric data and language instructions. We evaluate our method in multiple challenging manipulation tasks, and experimental results show our method can greatly improve the generalization and efficiency of meta-reinforcement learning.

READ FULL TEXT

page 4

page 6

research
09/11/2022

Meta-Reinforcement Learning via Language Instructions

Although deep reinforcement learning has recently been very successful a...
research
06/15/2017

Zero-Shot Task Generalization with Multi-Task Deep Reinforcement Learning

As a step towards developing zero-shot task generalization capabilities ...
research
12/21/2018

Learning to Navigate the Web

Learning in environments with large state and action spaces, and sparse ...
research
06/14/2023

Simple Embodied Language Learning as a Byproduct of Meta-Reinforcement Learning

Whereas machine learning models typically learn language by directly tra...
research
12/04/2020

Model-Agnostic Learning to Meta-Learn

In this paper, we propose a learning algorithm that enables a model to q...
research
08/01/2023

BiERL: A Meta Evolutionary Reinforcement Learning Framework via Bilevel Optimization

Evolutionary reinforcement learning (ERL) algorithms recently raise atte...
research
11/18/2022

Language-Conditioned Reinforcement Learning to Solve Misunderstandings with Action Corrections

Human-to-human conversation is not just talking and listening. It is an ...

Please sign up or login with your details

Forgot password? Click here to reset