The current standard approach to scaling transformer language models tra...
We propose PIGLeT: a model that learns physical commonsense knowledge th...
Shallow syntax provides an approximation of phrase-syntactic structure o...
Contextual word representations derived from large-scale neural language...
While most previous work has focused on different pretraining objectives...
We revisit domain adaptation for parsers in the neural era. First we sho...
We describe a deployed scalable system for organizing published scientif...
This paper describes AllenNLP, a platform for research on deep learning
...