Daniil Tiapkin

Chat Image Generator Video Music Voice Chat Photo Editor

Featured Co-authors

Remi Munos
89 publications
Michal Valko
73 publications
Eric Moulines
63 publications
Yunhao Tang
41 publications
Mark Rowland
40 publications
Pierre Ménard
31 publications
Alexander Gasnikov
25 publications
Denis Belomestny
24 publications
Daniele Calandriello
21 publications
Pavel Dvurechensky
19 publications
Michael Muehlebach
14 publications

research

∙ 04/06/2023

Sharp Deviations Bounds for Dirichlet Weighted Sums with Application to analysis of Bayesian algorithms

In this work, we derive sharp non-asymptotic deviation bounds for weight...

0 Denis Belomestny, et al. ∙

research

∙ 03/16/2023

Orthogonal Directions Constrained Gradient Method: from non-linear equality constraints to Stiefel manifold

We consider the problem of minimizing a non-convex function over a smoot...

0 Sholom Schechtman, et al. ∙

research

∙ 03/14/2023

Fast Rates for Maximum Entropy Exploration

We consider the reinforcement learning (RL) setting, in which the agent ...

0 Daniil Tiapkin, et al. ∙

research

∙ 09/28/2022

Optimistic Posterior Sampling for Reinforcement Learning with Few Samples and Tight Guarantees

We consider reinforcement learning in an environment modeled by an episo...

0 Daniil Tiapkin, et al. ∙

research

∙ 05/16/2022

From Dirichlet to Rubin: Optimistic Exploration in RL without Bonuses

We propose the Bayes-UCBVI algorithm for reinforcement learning in tabul...

0 Daniil Tiapkin, et al. ∙

research

∙ 02/27/2021

Parallel Stochastic Mirror Descent for MDPs

We consider the problem of learning the optimal policy for infinite-hori...

0 Daniil Tiapkin, et al. ∙

research

∙ 06/11/2020

Stochastic Saddle-Point Optimization for Wasserstein Barycenters

We study the computation of non-regularized Wasserstein barycenters of p...

0 Daniil Tiapkin, et al. ∙

Success!

An error occurred

Daniil Tiapkin

Featured Co-authors

Sharp Deviations Bounds for Dirichlet Weighted Sums with Application to analysis of Bayesian algorithms

Orthogonal Directions Constrained Gradient Method: from non-linear equality constraints to Stiefel manifold

Fast Rates for Maximum Entropy Exploration

Optimistic Posterior Sampling for Reinforcement Learning with Few Samples and Tight Guarantees

From Dirichlet to Rubin: Optimistic Exploration in RL without Bonuses

Parallel Stochastic Mirror Descent for MDPs

Stochastic Saddle-Point Optimization for Wasserstein Barycenters

Sign in with Google

Consider DeepAI Pro