Multi-Task Policy Search

07/02/2013
by   Marc Peter Deisenroth, et al.
0

Learning policies that generalize across multiple tasks is an important and challenging research topic in reinforcement learning and robotics. Training individual policies for every single potential task is often impractical, especially for continuous task variations, requiring more principled approaches to share and transfer knowledge among similar tasks. We present a novel approach for learning a nonlinear feedback policy that generalizes across multiple tasks. The key idea is to define a parametrized policy as a function of both the state and the task, which allows learning a single policy that generalizes across multiple known and unknown tasks. Applications of our novel approach to reinforcement and imitation learning in real-robot experiments are shown.

READ FULL TEXT

page 1

page 3

page 6

page 7

research
08/14/2018

Shared Multi-Task Imitation Learning for Indoor Self-Navigation

Deep imitation learning enables robots to learn from expert demonstratio...
research
09/20/2023

Prompt, Plan, Perform: LLM-based Humanoid Control via Quantized Imitation Learning

In recent years, reinforcement learning and imitation learning have show...
research
02/13/2019

Simultaneously Learning Vision and Feature-based Control Policies for Real-world Ball-in-a-Cup

We present a method for fast training of vision based control policies o...
research
10/27/2020

One Solution is Not All You Need: Few-Shot Extrapolation via Structured MaxEnt RL

While reinforcement learning algorithms can learn effective policies for...
research
12/09/2022

Multi-Task Off-Policy Learning from Bandit Feedback

Many practical applications, such as recommender systems and learning to...
research
03/10/2021

RMP2: A Structured Composable Policy Class for Robot Learning

We consider the problem of learning motion policies for acceleration-bas...
research
02/19/2023

Robust and Versatile Bipedal Jumping Control through Multi-Task Reinforcement Learning

This work aims to push the limits of agility for bipedal robots by enabl...

Please sign up or login with your details

Forgot password? Click here to reset