Joshua Susskind | DeepAI

DeepAI

AI Chat AI Image Generator AI Video AI Music Voice Chat AI Photo Editor Math AI

Featured Co-authors

Jian Zhang
226 publications
Ruslan Salakhutdinov
194 publications
Yue Wu
122 publications
Samy Bengio
55 publications
Navdeep Jaitly
39 publications
Emmanuel Abbe
37 publications
Chen Huang
30 publications
Shuangfei Zhai
28 publications
Tatiana Likhomanenko
26 publications
Jason Ramapuram
18 publications
Enric Boix-Adserà
16 publications

research

∙ 06/12/2023

Transformers learn through gradual rank increase

We identify incremental learning dynamics in transformers, where the dif...

0 Enric Boix-Adserà, et al. ∙

research

∙ 07/15/2022

Position Prediction as an Effective Pretraining Strategy

Transformers have gained increasing popularity in a wide range of applic...

1 Shuangfei Zhai, et al. ∙

research

∙ 06/10/2022

The Slingshot Mechanism: An Empirical Study of Adaptive Optimizers and the Grokking Phenomenon

The grokking phenomenon as reported by Power et al. ( arXiv:2201.02177 )...

13 Vimal Thilak, et al. ∙

research

∙ 01/28/2022

Efficient Embedding of Semantic Similarity in Control Policies via Entangled Bisimulation

Learning generalizeable policies from visual input in the presence of vi...

0 Martin Bertran, et al. ∙

research

∙ 05/17/2021

Uncertainty Weighted Actor-Critic for Offline Reinforcement Learning

Offline Reinforcement Learning promises to learn effective policies from...

18 Yue Wu, et al. ∙

research

∙ 06/13/2020

Collegial Ensembles

Modern neural network performance typically improves as model size incre...

0 Etai Littwin, et al. ∙