b'Silviu Pitis'

research

∙ 04/12/2023

Boosted Prompt Ensembles for Large Language Models

Methods such as chain-of-thought prompting and self-consistency have pus...

0 Silviu Pitis, et al. ∙

research

∙ 11/03/2022

Large Language Models Are Human-Level Prompt Engineers

By conditioning on natural language instructions, large language models ...

0 Yongchao Zhou, et al. ∙

research

∙ 10/20/2022

MoCoDA: Model-based Counterfactual Data Augmentation

The number of states in a dynamic process is exponential in the number o...

0 Silviu Pitis, et al. ∙

research

∙ 07/06/2020

Counterfactual Data Augmentation using Locally Factored Dynamics

Many dynamic processes, including common scenarios in robotic control an...

0 Silviu Pitis, et al. ∙

research

∙ 07/06/2020

Maximum Entropy Gain Exploration for Long Horizon Multi-goal Reinforcement Learning

What goals should a multi-goal reinforcement learning agent pursue durin...

0 Silviu Pitis, et al. ∙

research

∙ 02/14/2020

An Inductive Bias for Distances: Neural Nets that Respect the Triangle Inequality

Distances are pervasive in machine learning. They serve as similarity me...

9 Silviu Pitis, et al. ∙

research

∙ 01/27/2020

Objective Social Choice: Using Auxiliary Information to Improve Voting Outcomes

How should one combine noisy information from diverse sources to make an...

0 Silviu Pitis, et al. ∙

research

∙ 09/09/2019

Fixed-Horizon Temporal Difference Methods for Stable Reinforcement Learning

We explore fixed-horizon temporal difference (TD) methods, reinforcement...

0 Kristopher De Asis, et al. ∙

research

∙ 02/08/2019

Source Traces for Temporal Difference Learning

This paper motivates and develops source traces for temporal difference ...

0 Silviu Pitis, et al. ∙

research

∙ 02/08/2019

Rethinking the Discount Factor in Reinforcement Learning: A Decision Theoretic Approach

Reinforcement learning (RL) agents have traditionally been tasked with m...

0 Silviu Pitis, et al. ∙

Silviu Pitis

Featured Co-authors

Sign in with Google

Consider DeepAI Pro