Evaluating Disentanglement of Structured Latent Representations

01/11/2021
by   Raphaël Dang-Nhu, et al.
8

We design the first multi-layer disentanglement metric operating at all hierarchy levels of a structured latent representation, and derive its theoretical properties. Applied to object-centric representations, our metric unifies the evaluation of both object separation between latent slots and internal slot disentanglement into a common mathematical framework. It also addresses the problematic dependence on segmentation mask sharpness of previous pixel-level segmentation metrics such as ARI. Perhaps surprisingly, our experimental results show that good ARI values do not guarantee a disentangled representation, and that the exclusive focus on this metric has led to counterproductive choices in some previous evaluations. As an additional technical contribution, we present a new algorithm for obtaining feature importances that handles slot permutation invariance in the representation.

READ FULL TEXT

page 7

page 8

page 19

page 20

page 21

page 22

page 23

research
12/31/2020

Language-Mediated, Object-Centric Representation Learning

We present Language-mediated, Object-centric Representation Learning (LO...
research
11/02/2022

Neural Block-Slot Representations

In this paper, we propose a novel object-centric representation, called ...
research
11/26/2020

A Metric for Linear Symmetry-Based Disentanglement

The definition of Linear Symmetry-Based Disentanglement (LSBD) proposed ...
research
10/23/2020

Generative Neurosymbolic Machines

Reconciling symbolic and distributed representations is a crucial challe...
research
07/18/2020

Slot Contrastive Networks: A Contrastive Approach for Representing Objects

Unsupervised extraction of objects from low-level visual data is an impo...
research
12/16/2021

Slot-VPS: Object-centric Representation Learning for Video Panoptic Segmentation

Video Panoptic Segmentation (VPS) aims at assigning a class label to eac...
research
04/04/2023

Divided Attention: Unsupervised Multi-Object Discovery with Contextually Separated Slots

We introduce a method to segment the visual field into independently mov...

Please sign up or login with your details

Forgot password? Click here to reset