b'Anca Dragan'

research

∙ 07/31/2023

Learning to Model the World with Language

To interact with humans in the world, agents need to understand the dive...

0 Jessy Lin, et al. ∙

research

∙ 07/27/2023

Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback

Reinforcement learning from human feedback (RLHF) is a technique for tra...

0 Stephen Casper, et al. ∙

research

∙ 06/30/2023

Goal Representations for Instruction Following: A Semi-Supervised Language Interface to Control

Our goal is for robots to follow natural language instructions like "put...

0 Vivek Myers, et al. ∙

research

∙ 06/14/2023

Toward Grounded Social Reasoning

Consider a robot tasked with tidying a desk with a meticulously construc...

0 Minae Kwon, et al. ∙

research

∙ 04/19/2023

Bridging RL Theory and Practice with the Effective Horizon

Deep reinforcement learning (RL) works impressively in some environments...

0 Cassidy Laidlaw, et al. ∙

research

∙ 03/08/2023

Automatically Auditing Large Language Models via Discrete Optimization

Auditing large language models for unexpected behaviors is critical to p...

0 Erik Jones, et al. ∙

research

∙ 03/03/2023

Learning to Influence Human Behavior with Offline Reinforcement Learning

In the real world, some of the most complex settings for learned agents ...

0 Joey Hong, et al. ∙

research

∙ 01/02/2023

Towards Modeling and Influencing the Dynamics of Human Learning

Humans have internal models of robots (like their physical capabilities)...

0 Ran Tian, et al. ∙

research

∙ 12/09/2022

On the Sensitivity of Reward Inference to Misspecified Human Models

Inferring reward functions from human behavior is at the center of value...

0 Joey Hong, et al. ∙

research

∙ 11/30/2022

Time-Efficient Reward Learning via Visually Assisted Cluster Ranking

One of the most successful paradigms for reward learning uses human feed...

0 David Zhang, et al. ∙

research

∙ 11/20/2022

UniMASK: Unified Inference in Sequential Decision Problems

Randomly masking and predicting word tokens has been a successful approa...

0 Micah Carroll, et al. ∙

research

∙ 11/03/2022

Optimal Behavior Prior: Data-Efficient Human Models for Improved Human-AI Collaboration

AI agents designed to collaborate with people benefit from models that e...

0 Mesut Yang, et al. ∙

research

∙ 04/28/2022

Towards Flexible Inference in Sequential Decision Problems via Bidirectional Transformers

Randomly masking and predicting word tokens has been a successful approa...

2 Micah Carroll, et al. ∙

research

∙ 04/25/2022

Estimating and Penalizing Induced Preference Shifts in Recommender Systems

The content that a recommender system (RS) shows to users influences the...

4 Micah Carroll, et al. ∙

research

∙ 04/22/2022

The Boltzmann Policy Distribution: Accounting for Systematic Suboptimality in Human Models

Models of human behavior for prediction and collaboration tend to fall i...

1 Cassidy Laidlaw, et al. ∙

research

∙ 04/05/2022

Inferring Rewards from Language in Context

In classic instruction following, language like "I'd like the JetBlue fl...

2 Jessy Lin, et al. ∙

research

∙ 11/12/2021

Human irrationality: both bad and good for reward inference

Assuming humans are (approximately) rational enables robots to infer rew...

0 Lawrence Chan, et al. ∙

research

∙ 11/04/2021

B-Pref: Benchmarking Preference-Based Reinforcement Learning

Reinforcement learning (RL) requires access to a reward function that in...

0 Kimin Lee, et al. ∙

research

∙ 07/05/2021

The MineRL BASALT Competition on Learning from Human Feedback

The last decade has seen a significant increase of interest in deep lear...

5 Rohin Shah, et al. ∙

research

∙ 04/08/2021

Learning What To Do by Simulating the Past

Since reward functions are hard to specify, recent work has focused on l...

0 David Lindner, et al. ∙

research

∙ 01/19/2021

Choice Set Misspecification in Reward Inference

Specifying reward functions for robots that operate in environments with...

0 Rachel Freedman, et al. ∙

research

∙ 06/26/2020

AvE: Assistance via Empowerment

One difficulty in using artificial agents for human-assistive applicatio...

5 Yuqing Du, et al. ∙

research

∙ 10/13/2019

On the Utility of Learning about Humans for Human-AI Coordination

While we would like agents that can coordinate with humans, current algo...

8 Micah Carroll, et al. ∙

research

∙ 02/12/2019

Preferences Implicit in the State of the World

Reinforcement learning (RL) agents optimize only the features specified ...

2 Rohin Shah, et al. ∙

research

∙ 01/24/2019

The Assistive Multi-Armed Bandit

Learning preferences implicit in the choices humans make is a well studi...

0 Lawrence Chan, et al. ∙

research

∙ 01/04/2019

On the Utility of Model Learning in HRI

Fundamental to robotics is the debate between model-based and model-free...

0 Rohan Choudhury*, et al. ∙

research

∙ 05/31/2018

Learning a Prior over Intent via Meta-Inverse Reinforcement Learning

A significant challenge for the practical application of reinforcement l...

0 Kelvin Xu, et al. ∙

research

∙ 02/06/2018

Shared Autonomy via Deep Reinforcement Learning

In shared autonomy, user input is combined with semi-autonomous control ...

0 Siddharth Reddy, et al. ∙

research

∙ 11/08/2017

Inverse Reward Design

Autonomous agents optimize the reward function we give them. What they d...

0 Dylan Hadfield-Menell, et al. ∙

research

∙ 05/28/2017

Should Robots be Obedient?

Intuitively, obedience -- following the order that a human gives -- seem...

0 Smitha Milli, et al. ∙

research

∙ 04/23/2017

Translating Neuralese

Several approaches have recently been proposed for learning decentralize...

0 Jacob Andreas, et al. ∙

research

∙ 11/24/2016

The Off-Switch Game

It is clear that one of the primary tools we can use to mitigate the pot...

0 Dylan Hadfield-Menell, et al. ∙

research

∙ 06/09/2016

Cooperative Inverse Reinforcement Learning

For an autonomous system to be helpful to humans and to pose no unwarran...

0 Dylan Hadfield-Menell, et al. ∙

Anca Dragan

Featured Co-authors

Sign in with Google

Consider DeepAI Pro