Sequence models based on linear state spaces (SSMs) have recently emerge...
Leveraging transfer learning has recently been shown to be an effective
...
Understanding the fundamental mechanism behind the success of transforme...
State space models have shown to be effective at modeling long range
dep...
Differential Privacy (DP) provides a formal framework for training machi...
We present ALX, an open-source library for distributed matrix factorizat...
We consider non-convex stochastic optimization using first-order algorit...
We construct an experimental setup in which changing the scale of
initia...
We provide an improved analysis of normalized SGD showing that adding
mo...
The Touchdown dataset (Chen et al., 2019) provides instructions by human...
VALAN is a lightweight and scalable software framework for deep reinforc...
Vision-and-Language Navigation (VLN) tasks such as Room-to-Room (R2R) re...
Vision-and-Language Navigation (VLN) is a natural language grounding tas...