Fully Differentiable and Interpretable Model for VIO with 4 Trainable Parameters

09/25/2021
by   Zexi Chen, et al.
0

Monocular visual-inertial odometry (VIO) is a critical problem in robotics and autonomous driving. Traditional methods solve this problem based on filtering or optimization. While being fully interpretable, they rely on manual interference and empirical parameter tuning. On the other hand, learning-based approaches allow for end-to-end training but require a large number of training data to learn millions of parameters. However, the non-interpretable and heavy models hinder the generalization ability. In this paper, we propose a fully differentiable, interpretable, and lightweight monocular VIO model that contains only 4 trainable parameters. Specifically, we first adopt Unscented Kalman Filter as a differentiable layer to predict the pitch and roll, where the covariance matrices of noise are learned to filter out the noise of the IMU raw data. Second, the refined pitch and roll are adopted to retrieve a gravity-aligned BEV image of each frame using differentiable camera projection. Finally, a differentiable pose estimator is utilized to estimate the remaining 4 DoF poses between the BEV frames. Our method allows for learning the covariance matrices end-to-end supervised by the pose estimation loss, demonstrating superior performance to empirical baselines. Experimental results on synthetic and real-world datasets demonstrate that our simple approach is competitive with state-of-the-art methods and generalizes well on unseen scenes.

READ FULL TEXT

page 1

page 6

research
03/14/2022

A Self-Supervised, Differentiable Kalman Filter for Uncertainty-Aware Visual-Inertial Odometry

Traditionally, visual-inertial-odometry (VIO) systems rely on filtering ...
research
04/18/2023

A Hyper-network Based End-to-end Visual Servoing with Arbitrary Desired Poses

Recently, several works achieve end-to-end visual servoing (VS) for robo...
research
10/09/2018

Functionally Modular and Interpretable Temporal Filtering for Robust Segmentation

The performance of autonomous systems heavily relies on their ability to...
research
05/12/2022

Efficient Deep Visual and Inertial Odometry with Adaptive Visual Modality Selection

In recent years, deep learning-based approaches for visual-inertial odom...
research
05/02/2023

EasyHeC: Accurate and Automatic Hand-eye Calibration via Differentiable Rendering and Space Exploration

Hand-eye calibration is a critical task in robotics, as it directly affe...
research
05/31/2022

Mask2Hand: Learning to Predict the 3D Hand Pose and Shape from Shadow

We present a self-trainable method, Mask2Hand, which learns to solve the...
research
03/25/2019

Learning Monocular Visual Odometry through Geometry-Aware Curriculum Learning

Inspired by the cognitive process of humans and animals, Curriculum Lear...

Please sign up or login with your details

Forgot password? Click here to reset