The Sandbox Environment for Generalizable Agent Research (SEGAR)

03/19/2022
by   R Devon Hjelm, et al.
0

A broad challenge of research on generalization for sequential decision-making tasks in interactive environments is designing benchmarks that clearly landmark progress. While there has been notable headway, current benchmarks either do not provide suitable exposure nor intuitive control of the underlying factors, are not easy-to-implement, customizable, or extensible, or are computationally expensive to run. We built the Sandbox Environment for Generalizable Agent Research (SEGAR) with all of these things in mind. SEGAR improves the ease and accountability of generalization research in RL, as generalization objectives can be easy designed by specifying task distributions, which in turns allows the researcher to measure the nature of the generalization objective. We present an overview of SEGAR and how it contributes to these goals, as well as experiments that demonstrate a few types of research questions SEGAR can help answer.

READ FULL TEXT

page 8

page 9

research
08/20/2021

Explainable Reinforcement Learning for Broad-XAI: A Conceptual Framework and Survey

Broad Explainable Artificial Intelligence moves away from interpreting i...
research
06/04/2021

Be Considerate: Objectives, Side Effects, and Deciding How to Act

Recent work in AI safety has highlighted that in sequential decision mak...
research
02/14/2022

Provably Efficient Causal Model-Based Reinforcement Learning for Systematic Generalization

In the sequential decision making setting, an agent aims to achieve syst...
research
02/04/2019

Obstacle Tower: A Generalization Challenge in Vision, Control, and Planning

The rapid pace of research in Deep Reinforcement Learning has been drive...
research
03/15/2022

Zipfian environments for Reinforcement Learning

As humans and animals learn in the natural world, they encounter distrib...
research
02/06/2023

Grounding Large Language Models in Interactive Environments with Online Reinforcement Learning

Recent works successfully leveraged Large Language Models' (LLM) abiliti...

Please sign up or login with your details

Forgot password? Click here to reset