Zuyue Fu | DeepAI

Chat Image Generator Video Music Voice Chat Photo Editor

Featured Co-authors

Yang Liu
344 publications
Zhaoran Wang
121 publications
Zhuoran Yang
110 publications
Jing Jiang
95 publications
Mladen Kolar
63 publications
Yongxin Chen
45 publications
Michael R. Kosorok
36 publications
Lingxiao Wang
34 publications
Lan Wang
26 publications
Zhengling Qi
24 publications
Yufeng Zhang
23 publications

research

∙ 12/23/2022

Offline Reinforcement Learning for Human-Guided Human-Machine Interaction with Private Information

Motivated by the human-machine interaction such as training chatbots for...

0 Zuyue Fu, et al. ∙

research

∙ 09/18/2022

Offline Reinforcement Learning with Instrumental Variables in Confounded Markov Decision Processes

We study the offline reinforcement learning (RL) in the face of unmeasur...

5 Zuyue Fu, et al. ∙

research

∙ 10/24/2021

SCORE: Spurious COrrelation REduction for Offline Reinforcement Learning

Offline reinforcement learning (RL) aims to learn the optimal policy fro...

0 Zhihong Deng, et al. ∙

research

∙ 08/19/2021

Provably Efficient Generative Adversarial Imitation Learning for Online and Offline Setting with Linear Function Approximation

In generative adversarial imitation learning (GAIL), the agent aims to l...

0 Zhihan Liu, et al. ∙

research

∙ 02/19/2021

Instrumental Variable Value Iteration for Causal Offline Reinforcement Learning

In offline reinforcement learning (RL) an optimal policy is learnt solel...

1 Luofeng Liao, et al. ∙

research

∙ 08/02/2020

Single-Timescale Actor-Critic Provably Finds Globally Optimal Policy

We study the global convergence and global optimality of actor-critic, o...

0 Zuyue Fu, et al. ∙

research

∙ 10/16/2019

Actor-Critic Provably Finds Nash Equilibria of Linear-Quadratic Mean-Field Games

We study discrete-time mean-field Markov games with infinite numbers of ...

0 Zuyue Fu, et al. ∙

research

∙ 10/08/2019

Credible Sample Elicitation by Deep Learning, for Deep Learning

It is important to collect credible training samples (x,y) for building ...

0 Yang Liu, et al. ∙

Success!

An error occurred