CURI: A Benchmark for Productive Concept Learning Under Uncertainty

10/06/2020
by   Ramakrishna Vedantam, et al.
4

Humans can learn and reason under substantial uncertainty in a space of infinitely many concepts, including structured relational concepts ("a scene with objects that have the same color") and ad-hoc categories defined through goals ("objects that could fall on one's head"). In contrast, standard classification benchmarks: 1) consider only a fixed set of category labels, 2) do not evaluate compositional concept learning and 3) do not explicitly capture a notion of reasoning under uncertainty. We introduce a new few-shot, meta-learning benchmark, Compositional Reasoning Under Uncertainty (CURI) to bridge this gap. CURI evaluates different aspects of productive and systematic generalization, including abstract understandings of disentangling, productive generalization, learning boolean operations, variable binding, etc. Importantly, it also defines a model-independent "compositionality gap" to evaluate the difficulty of generalizing out-of-distribution along each of these axes. Extensive evaluations across a range of modeling choices spanning different modalities (image, schemas, and sounds), splits, privileged auxiliary concept information, and choices of negatives reveal substantial scope for modeling advances on the proposed task. All code and datasets will be available online.

READ FULL TEXT

page 11

page 12

page 13

page 14

page 17

research
05/27/2022

Bongard-HOI: Benchmarking Few-Shot Visual Reasoning for Human-Object Interactions

A significant gap remains between today's visual pattern recognition mod...
research
07/14/2020

Concept Learners for Generalizable Few-Shot Learning

Developing algorithms that are able to generalize to a novel task given ...
research
05/30/2023

Compositional diversity in visual concept learning

Humans leverage compositionality to efficiently learn new concepts, unde...
research
04/24/2022

RelViT: Concept-guided Vision Transformer for Visual Relational Reasoning

Reasoning about visual relationships is central to how humans interpret ...
research
05/13/2021

Shades of confusion: Lexical uncertainty modulates ad hoc coordination in an interactive communication task

There is substantial variability in the expectations that communication ...
research
08/03/2021

Generalization in Multimodal Language Learning from Simulation

Neural networks can be powerful function approximators, which are able t...
research
06/15/2021

Contextualizing Multiple Tasks via Learning to Decompose

One single instance could possess multiple portraits and reveal diverse ...

Please sign up or login with your details

Forgot password? Click here to reset