ENG: End-to-end Neural Geometry for Robust Depth and Pose Estimation using CNNs

07/16/2018
by   Thanuja Dharmasiri, et al.
6

Recovering structure and motion parameters given a image pair or a sequence of images is a well studied problem in computer vision. This is often achieved by employing Structure from Motion (SfM) or Simultaneous Localization and Mapping (SLAM) algorithms based on the real-time requirements. Recently, with the advent of Convolutional Neural Networks (CNNs) researchers have explored the possibility of using machine learning techniques to reconstruct the 3D structure of a scene and jointly predict the camera pose. In this work, we present a framework that achieves state-of-the-art performance on single image depth prediction for both indoor and outdoor scenes. The depth prediction system is then extended to predict optical flow and ultimately the camera pose and trained end-to-end. Our motion estimation framework outperforms the previous motion prediction systems and we also demonstrate that the state-of-the-art metric depths can be further improved using the knowledge of pose.

READ FULL TEXT

page 11

page 14

page 18

page 19

page 20

page 21

page 26

page 27

research
03/21/2022

DiffPoseNet: Direct Differentiable Camera Pose Estimation

Current deep neural network approaches for camera pose estimation rely o...
research
12/20/2019

DeepSFM: Structure From Motion Via Deep Bundle Adjustment

Structure from motion (SfM) is an essential computer vision problem whic...
research
01/30/2017

A Survey of Structure from Motion

The structure from motion (SfM) problem in computer vision is the proble...
research
10/11/2022

DeepMLE: A Robust Deep Maximum Likelihood Estimator for Two-view Structure from Motion

Two-view structure from motion (SfM) is the cornerstone of 3D reconstruc...
research
05/13/2020

3D Scene Geometry-Aware Constraint for Camera Localization with Deep Learning

Camera localization is a fundamental and key component of autonomous dri...
research
07/24/2018

CReaM: Condensed Real-time Models for Depth Prediction using Convolutional Neural Networks

Since the resurgence of CNNs the robotic vision community has developed ...
research
07/08/2020

When Perspective Comes for Free: Improving Depth Prediction with Camera Pose Encoding

Monocular depth prediction is a highly underdetermined problem and recen...

Please sign up or login with your details

Forgot password? Click here to reset