MobileDepth: Efficient Monocular Depth Prediction on Mobile Devices

11/20/2020
by   Yekai Wang, et al.
0

Depth prediction is fundamental for many useful applications on computer vision and robotic systems. On mobile phones, the performance of some useful applications such as augmented reality, autofocus and so on could be enhanced by accurate depth prediction. In this work, an efficient fully convolutional network architecture for depth prediction has been proposed, which uses RegNetY 06 as the encoder and split-concatenate shuffle blocks as decoder. At the same time, an appropriate combination of data augmentation, hyper-parameters and loss functions to efficiently train the lightweight network has been provided. Also, an Android application has been developed which can load CNN models to predict depth map by the monocular images captured from the mobile camera and evaluate the average latency and frame per second of the models. As a result, the network achieves 82.7 the same time, have only 62ms latency on ARM A76 CPUs so that it can predict the depth map from the mobile camera in real-time.

READ FULL TEXT

page 1

page 2

page 3

research
07/23/2018

MVDepthNet: Real-time Multiview Depth Estimation Neural Network

Although deep neural networks have been widely applied to computer visio...
research
09/12/2018

End-to-end depth from motion with stabilized monocular videos

We propose a depth map inference system from monocular videos based on a...
research
05/25/2021

Real-time Monocular Depth Estimation with Sparse Supervision on Mobile

Monocular (relative or metric) depth estimation is a critical task for v...
research
09/02/2022

LiteDepth: Digging into Fast and Accurate Depth Estimation on Mobile Devices

Monocular depth estimation is an essential task in the computer vision c...
research
09/21/2018

Dynamic Environment Mapping for Augmented Reality Applications on Mobile Devices

Augmented Reality is a topic of foremost interest nowadays. Its main goa...
research
08/27/2020

One Shot 3D Photography

3D photography is a new medium that allows viewers to more fully experie...
research
07/24/2018

CReaM: Condensed Real-time Models for Depth Prediction using Convolutional Neural Networks

Since the resurgence of CNNs the robotic vision community has developed ...

Please sign up or login with your details

Forgot password? Click here to reset