Multi-task Representation Learning with Stochastic Linear Bandits

02/21/2022
by   Leonardo Cella, et al.
0

We study the problem of transfer-learning in the setting of stochastic linear bandit tasks. We consider that a low dimensional linear representation is shared across the tasks, and study the benefit of learning this representation in the multi-task learning setting. Following recent results to design stochastic bandit policies, we propose an efficient greedy policy based on trace norm regularization. It implicitly learns a low dimensional representation by encouraging the matrix formed by the task regression vectors to be of low rank. Unlike previous work in the literature, our policy does not need to know the rank of the underlying matrix. We derive an upper bound on the multi-task regret of our policy, which is, up to logarithmic factors, of order √(NdT(T+d)r), where T is the number of tasks, r the rank, d the number of variables and N the number of rounds per task. We show the benefit of our strategy compared to the baseline Td√(N) obtained by solving each task independently. We also provide a lower bound to the multi-task regret. Finally, we corroborate our theoretical findings with preliminary experiments on synthetic data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/29/2022

Nearly Minimax Algorithms for Linear Bandits with Shared Representation

We give novel algorithms for multi-task and lifelong linear bandits with...
research
05/30/2022

Meta Representation Learning with Contextual Linear Bandits

Meta-learning seeks to build algorithms that rapidly learn how to solve ...
research
12/09/2022

Multi-Task Off-Policy Learning from Bandit Feedback

Many practical applications, such as recommender systems and learning to...
research
02/14/2022

Trace norm regularization for multi-task learning with scarce data

Multi-task learning leverages structural similarities between multiple t...
research
06/17/2022

Thompson Sampling for Robust Transfer in Multi-Task Bandits

We study the problem of online multi-task learning where the tasks are p...
research
06/24/2022

Joint Representation Training in Sequential Tasks with Shared Structure

Classical theory in reinforcement learning (RL) predominantly focuses on...
research
08/31/2010

Union Support Recovery in Multi-task Learning

We sharply characterize the performance of different penalization scheme...

Please sign up or login with your details

Forgot password? Click here to reset