Karl Cobbe

Chat Image Generator Video Music Voice Chat Photo Editor

Featured Co-authors

Adrien Gaidon
70 publications
Ilya Sutskever
53 publications
John Schulman
31 publications
Jan Leike
27 publications
Matthew Hausknecht
23 publications
Taehoon Kim
17 publications
Andrey Kolobov
16 publications
Xiaocheng Tang
13 publications
Sharada Mohanty
11 publications
Jacob Hilton
11 publications
Gretchen Krueger
9 publications

research

∙ 05/31/2023

Let's Verify Step by Step

In recent years, large language models have greatly improved in their ab...

1 Hunter Lightman, et al. ∙

research

∙ 12/17/2021

WebGPT: Browser-assisted question-answering with human feedback

We fine-tune GPT-3 to answer long-form questions using a text-based web-...

0 Reiichiro Nakano, et al. ∙

research

∙ 10/27/2021

Training Verifiers to Solve Math Word Problems

State-of-the-art language models can match human performance on many tas...

0 Karl Cobbe, et al. ∙

research

∙ 10/01/2021

Batch size-invariance for policy optimization

We say an algorithm is batch size-invariant if changes to the batch size...

21 Jacob Hilton, et al. ∙

research

∙ 03/29/2021

Measuring Sample Efficiency and Generalization in Reinforcement Learning Benchmarks: NeurIPS 2020 Procgen Benchmark

The NeurIPS 2020 Procgen Competition was designed as a centralized bench...

26 Sharada Mohanty, et al. ∙

research

∙ 09/09/2020

Phasic Policy Gradient

We introduce Phasic Policy Gradient (PPG), a reinforcement learning fram...

0 Karl Cobbe, et al. ∙

research

∙ 12/03/2019

Leveraging Procedural Generation to Benchmark Reinforcement Learning

In this report, we introduce Procgen Benchmark, a suite of 16 procedural...

0 Karl Cobbe, et al. ∙

research

∙ 12/06/2018

Quantifying Generalization in Reinforcement Learning

In this paper, we investigate the problem of overfitting in deep reinfor...

14 Karl Cobbe, et al. ∙

Success!

An error occurred

Karl Cobbe

Featured Co-authors

Sign in with Google

Consider DeepAI Pro