While deep learning models have replaced hand-designed features across m...
We study the problem of efficient generative inference for Transformer
m...
Effective scaling and a flexible task interface enable large language mo...
Large language models have been shown to achieve remarkable performance
...
Recent neural network-based language models have benefited greatly from
...
Recent results in language understanding using neural networks have requ...
This paper presents a high-quality multilingual dataset for the document...
Deep learning frameworks have often focused on either usability or speed...
OpenSpiel is a collection of environments and algorithms for research in...
The process of designing neural architectures requires expert knowledge ...
Second-order methods for neural network optimization have several advant...
Existing approaches to neural machine translation condition each output ...
Building models that take advantage of the hierarchical structure of lan...
Computer vision has benefited from initializing multiple deep layers wit...
Recurrent neural networks are a powerful tool for modeling sequential da...
Recent neural network sequence models with softmax classifiers have achi...
Most tasks in natural language processing can be cast into question answ...