PanoContext-Former: Panoramic Total Scene Understanding with a Transformer

05/21/2023
by   Yuan Dong, et al.
0

Panoramic image enables deeper understanding and more holistic perception of 360^∘ surrounding environment, which can naturally encode enriched scene context information compared to standard perspective image. Previous work has made lots of effort to solve the scene understanding task in a bottom-up form, thus each sub-task is processed separately and few correlations are explored in this procedure. In this paper, we propose a novel method using depth prior for holistic indoor scene understanding which recovers the objects' shapes, oriented bounding boxes and the 3D room layout simultaneously from a single panorama. In order to fully utilize the rich context information, we design a transformer-based context module to predict the representation and relationship among each component of the scene. In addition, we introduce a real-world dataset for scene understanding, including photo-realistic panoramas, high-fidelity depth images, accurately annotated room layouts, and oriented object bounding boxes and shapes. Experiments on the synthetic and real-world datasets demonstrate that our method outperforms previous panoramic scene understanding methods in terms of both layout estimation and 3D object detection.

READ FULL TEXT

page 1

page 3

page 8

research
02/27/2020

Total3DUnderstanding: Joint Layout, Object Pose and Mesh Reconstruction for Indoor Scenes from a Single Image

Semantic reconstruction of indoor scenes refers to both scene understand...
research
11/28/2016

Generating Holistic 3D Scene Abstractions for Text-based Image Retrieval

Spatial relationships between objects provide important information for ...
research
10/31/2018

Cooperative Holistic Scene Understanding: Unifying 3D Object, Layout, and Camera Pose Estimation

Holistic 3D indoor scene understanding refers to jointly recovering the ...
research
03/01/2021

Panoramic Panoptic Segmentation: Towards Complete Surrounding Understanding via Unsupervised Contrastive Learning

In this work, we introduce panoramic panoptic segmentation as the most h...
research
11/25/2022

Learning 3D Scene Priors with 2D Supervision

Holistic 3D scene understanding entails estimation of both layout config...
research
07/22/2022

Panoptic Scene Graph Generation

Existing research addresses scene graph generation (SGG) – a critical te...
research
11/17/2021

ARKitScenes – A Diverse Real-World Dataset For 3D Indoor Scene Understanding Using Mobile RGB-D Data

Scene understanding is an active research area. Commercial depth sensors...

Please sign up or login with your details

Forgot password? Click here to reset