During the last stage of RLHF, a large language model is aligned to huma...
Long-form question answering (LFQA) enables answering a wide range of
qu...
In this paper, we study the generation quality of interpolation-based
re...
Evaluating the factuality of long-form text generated by large language
...
Large language models (LLMs) are competitive with the state of the art o...
To detect the deployment of large language models for malicious use case...
A key component of generating text from modern language models (LM) is t...
While human evaluation remains best practice for accurately judging the
...
Retrieval-enhanced language models (LMs), which condition their predicti...
Literary translation is a culturally significant task, but it is bottlen...
While machine translation evaluation metrics based on string overlap (e....
To understand what kinds of linguistic knowledge are encoded by pretrain...
Large-scale, high-quality corpora are critical for advancing research in...
In this paper, we explore the challenging problem of performing a genera...
Given an input sequence (or prefix), modern language models often assign...
Exemplification is a process by which writers explain or clarify a conce...
While numerous architectures for long-range language models (LRLMs) have...
Humanities scholars commonly provide evidence for claims that they make ...
Language models are generally trained on short, truncated input sequence...
Recent text generation research has increasingly focused on open-ended
d...
Phrase representations derived from BERT often do not exhibit complex ph...
Despite their recent successes in tackling many NLP tasks, large-scale
p...
For over thirty years, researchers have developed and analyzed methods f...
Existing work on tabular representation learning jointly models tables a...
Modeling human mobility has a wide range of applications from urban plan...
While large-scale pretrained language models have significantly improved...
Recent progress in language modeling has been driven not only by advance...
Large Transformer-based language models can aid human authors by suggest...
The task of long-form question answering (LFQA) involves retrieving docu...
Recent studies on Question Answering (QA) and Conversational QA (ConvQA)...
Popular media reflects and reinforces societal biases through the use of...
Modern NLP defines the task of style transfer as modifying the style of ...
Systems for story generation are asked to produce plausible and enjoyabl...
The discrepancy between maximum likelihood estimation (MLE) and task mea...
Conversational search is one of the ultimate goals of information retrie...
Recent advances in NLP demonstrate the effectiveness of training large-s...
Recent work has questioned the importance of the Transformer's multi-hea...
We study the problem of model extraction in natural language processing,...
Sports broadcasters inject drama into play-by-play commentary by buildin...
Conversational question answering (ConvQA) is a simplified but concrete
...
While paragraph embedding models are remarkably effective for downstream...
Standard decoders for neural machine translation autoregressively genera...
The process of knowledge acquisition can be viewed as a question-answer ...
Conversational search is an emerging topic in the information retrieval
...
Literary critics often attempt to uncover meaning in a single work of
li...
Quizbowl is a scholastic trivia competition that tests human knowledge a...
We introduce deep inside-outside recursive autoencoders (DIORA), a
fully...
We analyze the performance of different sentiment classification models ...
We present QuAC, a dataset for Question Answering in Context that contai...
Methods for learning word sense embeddings represent a single word with
...