Multi-Plane Program Induction with 3D Box Priors

11/19/2020
by   Yikai Li, et al.
0

We consider two important aspects in understanding and editing images: modeling regular, program-like texture or patterns in 2D planes, and 3D posing of these planes in the scene. Unlike prior work on image-based program synthesis, which assumes the image contains a single visible 2D plane, we present Box Program Induction (BPI), which infers a program-like scene representation that simultaneously models repeated structure on multiple 2D planes, the 3D position and orientation of the planes, and camera parameters, all from a single image. Our model assumes a box prior, i.e., that the image captures either an inner view or an outer view of a box in 3D. It uses neural networks to infer visual cues such as vanishing points, wireframe lines to guide a search-based algorithm to find the program that best explains the image. Such a holistic, structured scene representation enables 3D-aware interactive image editing operations such as inpainting missing pixels, changing camera parameters, and extrapolate the image contents.

READ FULL TEXT

page 2

page 4

page 8

page 9

page 16

page 17

page 18

page 19

research
06/25/2020

Perspective Plane Program Induction from a Single Image

We study the inverse graphics problem of inferring a holistic representa...
research
09/04/2019

Program-Guided Image Manipulators

Humans are capable of building holistic representations for images at va...
research
07/26/2018

Layer-structured 3D Scene Inference via View Synthesis

We present an approach to infer a layer-structured 3D representation of ...
research
03/21/2023

Interactive Geometry Editing of Neural Radiance Fields

In this paper, we propose a method that enables the interactive geometry...
research
04/02/2023

altiro3D: Scene representation from single image and novel view synthesis

We introduce altiro3D, a free extended library developed to represent re...
research
03/24/2023

AssetField: Assets Mining and Reconfiguration in Ground Feature Plane Representation

Both indoor and outdoor environments are inherently structured and repet...
research
07/22/2022

Seeing 3D Objects in a Single Image via Self-Supervised Static-Dynamic Disentanglement

Human perception reliably identifies movable and immovable parts of 3D s...

Please sign up or login with your details

Forgot password? Click here to reset