Depth Field Networks for Generalizable Multi-view Scene Representation

07/28/2022
by   Vitor Guizilini, et al.
6

Modern 3D computer vision leverages learning to boost geometric reasoning, mapping image data to classical structures such as cost volumes or epipolar constraints to improve matching. These architectures are specialized according to the particular problem, and thus require significant task-specific tuning, often leading to poor domain generalization performance. Recently, generalist Transformer architectures have achieved impressive results in tasks such as optical flow and depth estimation by encoding geometric priors as inputs rather than as enforced constraints. In this paper, we extend this idea and propose to learn an implicit, multi-view consistent scene representation, introducing a series of 3D data augmentation techniques as a geometric inductive prior to increase view diversity. We also show that introducing view synthesis as an auxiliary task further improves depth estimation. Our Depth Field Networks (DeFiNe) achieve state-of-the-art results in stereo and video depth estimation without explicit geometric constraints, and improve on zero-shot domain generalization by a wide margin.

READ FULL TEXT

page 11

page 14

page 17

page 19

research
10/21/2022

Context-Enhanced Stereo Transformer

Stereo depth estimation is of great interest for computer vision researc...
research
04/17/2019

Multi-Scale Geometric Consistency Guided Multi-View Stereo

In this paper, we propose an efficient multi-scale geometric consistency...
research
03/19/2020

Depth Estimation by Learning Triangulation and Densification of Sparse Points for Multi-view Stereo

Multi-view stereo (MVS) is the golden mean between the accuracy of activ...
research
12/06/2021

Input-level Inductive Biases for 3D Reconstruction

Much of the recent progress in 3D vision has been driven by the developm...
research
05/05/2022

Exploiting Correspondences with All-pairs Correlations for Multi-view Depth Estimation

Multi-view depth estimation plays a critical role in reconstructing and ...
research
06/24/2021

FaDIV-Syn: Fast Depth-Independent View Synthesis

We introduce FaDIV-Syn, a fast depth-independent view synthesis method. ...
research
04/13/2023

iDisc: Internal Discretization for Monocular Depth Estimation

Monocular depth estimation is fundamental for 3D scene understanding and...

Please sign up or login with your details

Forgot password? Click here to reset