Hanlin Zhu | DeepAI

DeepAI

AI Chat AI Image Generator AI Video AI Music Voice Chat AI Photo Editor Math AI

Featured Co-authors

Minlie Huang
138 publications
David P. Woodruff
116 publications
Jason D. Lee
100 publications
Stuart Russell
64 publications
Amy Zhang
52 publications
Jiantao Jiao
46 publications
Xue Li
40 publications
Ruosong Wang
33 publications
Cyrus Rashtchian
20 publications
Fei He
18 publications
Peng Ye
18 publications

research

∙ 02/22/2023

Provably Efficient Reinforcement Learning via Surprise Bound

Value function approximation is important in modern reinforcement learni...

0 Hanlin Zhu, et al. ∙

research

∙ 02/07/2023

Provably Efficient Offline Goal-Conditioned Reinforcement Learning with General Function Approximation and Single-Policy Concentrability

Goal-conditioned reinforcement learning (GCRL) refers to learning genera...

0 Hanlin Zhu, et al. ∙

research

∙ 01/30/2023

Importance Weighted Actor-Critic for Optimal Conservative Offline Reinforcement Learning

We propose A-Crab (Actor-Critic Regularized by Average Bellman error), a...

0 Hanlin Zhu, et al. ∙

research

∙ 11/01/2022

Optimal Conservative Offline RL with General Function Approximation via Augmented Lagrangian

Offline reinforcement learning (RL), which refers to decision-making fro...

0 Paria Rashidinejad, et al. ∙

research

∙ 07/03/2021

Average-Case Communication Complexity of Statistical Problems

We study statistical problems, such as planted clique, its variants, and...

0 Cyrus Rashtchian, et al. ∙

research

∙ 06/24/2020

Vector-Matrix-Vector Queries for Solving Linear Algebra, Statistics, and Graph Problems

We consider the general problem of learning about a matrix through vecto...

0 Cyrus Rashtchian, et al. ∙

research

∙ 03/19/2020

Clustering with Fast, Automated and Reproducible assessment applied to longitudinal neural tracking

Across many areas, from neural tracking to database entity resolution, m...

14 Hanlin Zhu, et al. ∙

research

∙ 08/28/2019

Guided Dialog Policy Learning: Reward Estimation for Multi-Domain Task-Oriented Dialog

Dialog policy decides what and how a task-oriented dialog system will re...

0 Ryuichi Takanobu, et al. ∙