We consider the problem of contextual bandits and imitation learning, wh...
We consider the problem of Imitation Learning (IL) by actively querying ...
We introduce camouflaged data poisoning attacks, a new attack vector tha...
We consider the problem of interactive decision making, encompassing
str...
We consider a hybrid reinforcement learning setting (Hybrid RL), in whic...
Stochastic Gradient Descent (SGD) has been the method of choice for lear...
A central problem in online learning and decision making – from bandits ...
We study reinforcement learning with function approximation for large-sc...
We study Reinforcement Learning for partially observable dynamical syste...
Myopic exploration policies such as epsilon-greedy, softmax, or Gaussian...
Multi-epoch, small-batch, Stochastic Gradient Descent (SGD) has been the...
There have been many recent advances on provably efficient Reinforcement...
We investigate the problem of active learning in the streaming setting i...
We study the problem of forgetting datapoints from a learnt model. In th...
We design an algorithm which finds an ϵ-approximate stationary point
(wi...
We study episodic reinforcement learning in Markov decision processes wh...
We give nearly matching upper and lower bounds on the oracle complexity ...
We investigate 1) the rate at which refined properties of the empirical
...
Domain knowledge can often be encoded in the structure of a network, suc...