Transformer-based model for monocular visual odometry: a video understanding approach

05/10/2023
by   André O. Françani, et al.
0

Estimating the camera pose given images of a single camera is a traditional task in mobile robots and autonomous vehicles. This problem is called monocular visual odometry and it often relies on geometric approaches that require engineering effort for a specific scenario. Deep learning methods have shown to be generalizable after proper training and a considerable amount of available data. Transformer-based architectures have dominated the state-of-the-art in natural language processing and computer vision tasks, such as image and video understanding. In this work, we deal with the monocular visual odometry as a video understanding task to estimate the 6-DoF camera's pose. We contribute by presenting the TSformer-VO model based on spatio-temporal self-attention mechanisms to extract features from clips and estimate the motions in an end-to-end manner. Our approach achieved competitive state-of-the-art performance compared with geometry-based and deep learning-based methods on the KITTI visual odometry dataset, outperforming the DeepVO implementation highly accepted in the visual odometry community.

READ FULL TEXT

page 3

page 10

research
10/04/2022

Dense Prediction Transformer for Scale Estimation in Monocular Visual Odometry

Monocular visual odometry consists of the estimation of the position of ...
research
07/07/2021

RAM-VO: Less is more in Visual Odometry

Building vehicles capable of operating without human supervision require...
research
12/23/2021

MDN-VO: Estimating Visual Odometry with Confidence

Visual Odometry (VO) is used in many applications including robotics and...
research
07/27/2020

WGANVO: Monocular Visual Odometry based on Generative Adversarial Networks

In this work we present WGANVO, a Deep Learning based monocular Visual O...
research
04/03/2019

Beyond Tracking: Selecting Memory and Refining Poses for Deep Visual Odometry

Most previous learning-based visual odometry (VO) methods take VO as a p...
research
09/12/2021

Towards Robust Monocular Visual Odometry for Flying Robots on Planetary Missions

In the future, extraterrestrial expeditions will not only be conducted b...
research
03/25/2019

Learning Monocular Visual Odometry through Geometry-Aware Curriculum Learning

Inspired by the cognitive process of humans and animals, Curriculum Lear...

Please sign up or login with your details

Forgot password? Click here to reset