Continuous Scene Representations for Embodied AI

03/31/2022
by   Samir Yitzhak Gadre, et al.
0

We propose Continuous Scene Representations (CSR), a scene representation constructed by an embodied agent navigating within a space, where objects and their relationships are modeled by continuous valued embeddings. Our method captures feature relationships between objects, composes them into a graph structure on-the-fly, and situates an embodied agent within the representation. Our key insight is to embed pair-wise relationships between objects in a latent space. This allows for a richer representation compared to discrete relations (e.g., [support], [next-to]) commonly used for building scene representations. CSR can track objects as the agent moves in a scene, update the representation accordingly, and detect changes in room configurations. Using CSR, we outperform state-of-the-art approaches for the challenging downstream task of visual room rearrangement, without any task specific training. Moreover, we show the learned embeddings capture salient spatial details of the scene and show applicability to real world data. A summery video and code is available at https://prior.allenai.org/projects/csr.

READ FULL TEXT

page 1

page 4

page 8

page 13

research
02/26/2019

Learning Latent Scene-Graph Representations for Referring Relationships

Understanding the semantics of complex visual scenes often requires anal...
research
10/19/2020

Language and Visual Entity Relationship Graph for Agent Navigation

Vision-and-Language Navigation (VLN) requires an agent to navigate in a ...
research
03/30/2021

Visual Room Rearrangement

There has been a significant recent progress in the field of Embodied AI...
research
07/12/2022

Language-Based Causal Representation Learning

Consider the finite state graph that results from a simple, discrete, dy...
research
07/21/2022

TIDEE: Tidying Up Novel Rooms using Visuo-Semantic Commonsense Priors

We introduce TIDEE, an embodied agent that tidies up a disordered scene ...
research
12/24/2014

Transformation Properties of Learned Visual Representations

When a three-dimensional object moves relative to an observer, a change ...
research
04/28/2023

SGAligner : 3D Scene Alignment with Scene Graphs

Building 3D scene graphs has recently emerged as a topic in scene repres...

Please sign up or login with your details

Forgot password? Click here to reset