HoHoNet: 360 Indoor Holistic Understanding with Latent Horizontal Features

11/23/2020
by   Cheng Sun, et al.
2

We present HoHoNet, a versatile and efficient framework for holistic understanding of an indoor 360-degree panorama using a Latent Horizontal Feature (LHFeat). The compact LHFeat flattens the features along the vertical direction and has shown success in modeling per-column modality for room layout reconstruction. HoHoNet advances in two important aspects. First, the deep architecture is redesigned to run faster with improved accuracy. Second, we propose a novel horizon-to-dense module, which relaxes the per-column output shape constraint, allowing per-pixel dense prediction from LHFeat. HoHoNet is fast: It runs at 52 FPS and 110 FPS with ResNet-50 and ResNet-34 backbones respectively, for modeling dense modalities from a high-resolution 512 × 1024 panorama. HoHoNet is also accurate. On the tasks of layout estimation and semantic segmentation, HoHoNet achieves results on par with current state-of-the-art. On dense depth estimation, HoHoNet outperforms all the prior arts by a large margin.

READ FULL TEXT

page 1

page 4

page 8

page 11

page 12

page 13

research
05/16/2023

PanelNet: Understanding 360 Indoor Environment via Panel Representation

Indoor 360 panoramas have two essential properties. (1) The panoramas ar...
research
03/03/2022

LGT-Net: Indoor Panoramic Room Layout Estimation with Geometry-Aware Transformer Network

3D room layout estimation by a single panorama using deep neural network...
research
06/22/2022

Monocular Spherical Depth Estimation with Explicitly Connected Weak Layout Cues

Spherical cameras capture scenes in a holistic manner and have been used...
research
12/12/2021

MVLayoutNet:3D layout reconstruction with multi-view panoramas

We present MVLayoutNet, an end-to-end network for holistic 3D reconstruc...
research
01/15/2020

Indoor Layout Estimation by 2D LiDAR and Camera Fusion

This paper presents an algorithm for indoor layout estimation and recons...
research
01/22/2022

Dual-Flattening Transformers through Decomposed Row and Column Queries for Semantic Segmentation

It is critical to obtain high resolution features with long range depend...
research
09/20/2023

Dense 2D-3D Indoor Prediction with Sound via Aligned Cross-Modal Distillation

Sound can convey significant information for spatial reasoning in our da...

Please sign up or login with your details

Forgot password? Click here to reset