Thomas Mesnard

research

∙ 09/01/2023

RLAIF: Scaling Reinforcement Learning from Human Feedback with AI Feedback

Reinforcement learning from human feedback (RLHF) is effective at aligni...

0 Harrison Lee, et al. ∙

research

∙ 11/18/2022

Curiosity in hindsight

Consider the exploration in sparse-reward or reward-free environments, s...

0 Daniel Jarrett, et al. ∙

research

∙ 01/06/2021

Geometric Entropic Exploration

Exploration is essential for solving complex Reinforcement Learning (RL)...

0 Zhaohan Daniel Guo, et al. ∙

research

∙ 11/18/2020

Counterfactual Credit Assignment in Model-Free Reinforcement Learning

Credit assignment in reinforcement learning is the problem of measuring ...

8 Thomas Mesnard, et al. ∙

research

∙ 12/05/2019

Hindsight Credit Assignment

We consider the problem of efficient credit assignment in reinforcement ...

0 Anna Harutyunyan, et al. ∙

research

∙ 11/15/2019

Ghost Units Yield Biologically Plausible Backprop in Deep Neural Networks

In the past few years, deep learning has transformed artificial intellig...

37 Thomas Mesnard, et al. ∙

research

∙ 08/14/2018

Generalization of Equilibrium Propagation to Vector Field Dynamics

The biological plausibility of the backpropagation algorithm has long be...

0 Benjamin Scellier, et al. ∙

research

∙ 12/09/2016

Towards deep learning with spiking neurons in energy based models with contrastive Hebbian plasticity

In machine learning, error back-propagation in multi-layer neural networ...

0 Thomas Mesnard, et al. ∙

research

∙ 09/19/2015

STDP as presynaptic activity times rate of change of postsynaptic activity

We introduce a weight update formula that is expressed only in terms of ...

0 Yoshua Bengio, et al. ∙

Thomas Mesnard

Featured Co-authors

Sign in with Google

Consider DeepAI Pro