Transformers have become the dominant model in deep learning, but the re...
Recent architectural developments have enabled recurrent neural networks...
Transformers have become the state-of-the-art neural network architectur...
Identifying unfamiliar inputs, also known as out-of-distribution (OOD)
d...
Neural networks trained with stochastic gradient descent (SGD) starting ...
Equilibrium systems are a powerful way to express neural computations. A...
Finding neural network weights that generalize well from small datasets ...
Meta-learning algorithms leverage regularities that are present on a set...
Continual Learning (CL) algorithms have recently received a lot of atten...
Averaging the predictions of many independently trained neural networks ...
The last decade has seen a surge of interest in continual learning (CL),...
Artificial neural networks suffer from catastrophic forgetting when they...