Inspired by recent work demonstrating the promise of smaller
Transformer...
Language models can be prompted to reason through problems in a manner t...
As the capabilities of large machine learning models continue to grow, a...
We present FACADE, a novel probabilistic and geometric framework designe...
Generative Pre-trained Transformer (GPT) models have exhibited exciting
...
Recent work claims that large language models display emergent abilities...
Double descent is a surprising phenomenon in machine learning, in which ...
Learning from a continuous stream of non-stationary data in an unsupervi...
Humans sometimes choose actions that they themselves can identify as
sub...