Compositional Visual Generation and Inference with Energy Based Models

04/13/2020
by   Yilun Du, et al.
1

A vital aspect of human intelligence is the ability to compose increasingly complex concepts out of simpler ideas, enabling both rapid learning and adaptation of knowledge. In this paper we show that energy-based models can exhibit this ability by directly combining probability distributions. Samples from the combined distribution correspond to compositions of concepts. For example, given a distribution for smiling faces, and another for male faces, we can combine them to generate smiling male faces. This allows us to generate natural images that simultaneously satisfy conjunctions, disjunctions, and negations of concepts. We evaluate compositional generation abilities of our model on the CelebA dataset of natural faces and synthetic 3D scene images. We also demonstrate other unique advantages of our model, such as the ability to continually learn and incorporate new concepts, or infer compositions of concept properties underlying an image.

READ FULL TEXT

page 2

page 4

page 5

page 6

page 7

page 8

page 12

page 13

research
06/03/2022

Compositional Visual Generation with Composable Diffusion Models

Large text-guided diffusion models, such as DALLE-2, are able to generat...
research
11/04/2021

Unsupervised Learning of Compositional Energy Concepts

Humans are able to rapidly understand scenes by utilizing concepts extra...
research
06/07/2023

ConceptBed: Evaluating Concept Learning Abilities of Text-to-Image Diffusion Models

The ability to understand visual concepts and replicate and compose thes...
research
11/06/2018

Concept Learning with Energy-Based Models

Many hallmarks of human intelligence, such as generalizing from limited ...
research
05/29/2023

Vector-based Representation is the Key: A Study on Disentanglement and Compositional Generalization

Recognizing elementary underlying concepts from observations (disentangl...
research
04/27/2015

Simple Derivation of the Lifetime and the Distribution of Faces for a Binary Subdivision Model

The iterative random subdivision of rectangles is used as a generation m...
research
05/25/2016

Action Classification via Concepts and Attributes

Classes in natural images tend to follow long tail distributions. This i...

Please sign up or login with your details

Forgot password? Click here to reset