b'Ves Stoyanov'

research

∙ 02/16/2023

LEVER: Learning to Verify Language-to-Code Generation with Execution

The advent of large language models trained on code (code LLMs) has led ...

0 Ansong Ni, et al. ∙

research

∙ 12/22/2022

OPT-IML: Scaling Language Model Instruction Meta Learning through the Lens of Generalization

Recent work has shown that fine-tuning large pre-trained language models...

0 Srinivasan Iyer, et al. ∙

research

∙ 12/19/2022

Training Trajectories of Language Models Across Scales

Scaling up language models has led to unprecedented performance gains, b...

0 Mengzhou Xia, et al. ∙

research

∙ 11/25/2022

Complementary Explanations for Effective In-Context Learning

Large language models (LLMs) have exhibited remarkable capabilities in l...

0 Xi Ye, et al. ∙

research

∙ 05/30/2022

Prompting ELECTRA: Few-Shot Learning with Discriminative Pre-Trained Models

Pre-trained masked language models successfully perform few-shot learnin...

0 Mengzhou Xia, et al. ∙

research

∙ 05/25/2022

ToKen: Task Decomposition and Knowledge Infusion for Few-Shot Hate Speech Detection

Hate speech detection is complex; it relies on commonsense reasoning, kn...

0 Badr AlKhamissi, et al. ∙

research

∙ 05/24/2022

On the Role of Bidirectionality in Language Model Pre-Training

Prior work on language model pre-training has explored different archite...

0 Mikel Artetxe, et al. ∙

research

∙ 03/14/2022

Efficient Language Modeling with Sparse all-MLP

All-MLP architectures have attracted increasing interest as an alternati...

7 Ping Yu, et al. ∙

research

∙ 12/20/2021

Efficient Large Scale Language Modeling with Mixtures of Experts

Mixture of Experts layers (MoEs) enable efficient scaling of language mo...

10 Mikel Artetxe, et al. ∙

research

∙ 11/03/2020

Supervised Contrastive Learning for Pre-trained Language Model Fine-tuning

State-of-the-art natural language understanding classification models fo...

16 Beliz Gunel, et al. ∙

research

∙ 10/05/2020

Self-training Improves Pre-training for Natural Language Understanding

Unsupervised pre-training has led to much recent progress in natural lan...

0 Jingfei Du, et al. ∙

research

∙ 09/28/2020

Conversational Semantic Parsing

The structured representation for semantic parsing in task-oriented assi...

0 Armen Aghajanyan, et al. ∙

research

∙ 09/22/2020

Preserving Integrity in Online Social Networks

Online social networks provide a platform for sharing information and fr...

9 Alon Halevy, et al. ∙

research

∙ 10/29/2019

BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension

We present BART, a denoising autoencoder for pretraining sequence-to-seq...

35 Mike Lewis, et al. ∙

Ves Stoyanov

Featured Co-authors

Sign in with Google

Consider DeepAI Pro