Multi-scale Cross-form Pyramid Network for Stereo Matching

04/25/2019
by   Zhidong Zhu, et al.
12

Stereo matching plays an indispensable part in autonomous driving, robotics and 3D scene reconstruction. We propose a novel deep learning architecture, which called CFP-Net, a Cross-Form Pyramid stereo matching network for regressing disparity from a rectified pair of stereo images. The network consists of three modules: Multi-Scale 2D local feature extraction module, Cross-form spatial pyramid module and Multi-Scale 3D Feature Matching and Fusion module. The Multi-Scale 2D local feature extraction module can extract enough multi-scale features. The Cross-form spatial pyramid module aggregates the context information in different scales and locations to form a cost volume. Moreover, it is proved to be more effective than SPP and ASPP in ill-posed regions. The Multi-Scale 3D feature matching and fusion module is proved to regularize the cost volume using two parallel 3D deconvolution structure with two different receptive fields. Our proposed method has been evaluated on the Scene Flow and KITTI datasets. It achieves state-of-the-art performance on the KITTI 2012 and 2015 benchmarks.

READ FULL TEXT

page 3

page 4

page 5

page 6

research
04/25/2019

MSDC-Net: Multi-Scale Dense and Contextual Networks for Automated Disparity Map for Stereo Matching

Disparity prediction from stereo images is essential to computer vision ...
research
03/23/2018

Pyramid Stereo Matching Network

Recent work has shown that depth estimation from a stereo pair of images...
research
02/19/2021

Serial-parallel Multi-Scale Feature Fusion for Anatomy-Oriented Hand Joint Detection

Accurate hand joints detection from images is a fundamental topic which ...
research
03/05/2021

ES-Net: An Efficient Stereo Matching Network

Dense stereo matching with deep neural networks is of great interest to ...
research
08/06/2023

Multi-scale Alternated Attention Transformer for Generalized Stereo Matching

Recent stereo matching networks achieves dramatic performance by introdu...
research
04/03/2019

StereoDRNet: Dilated Residual Stereo Net

We propose a system that uses a convolution neural network (CNN) to esti...
research
04/17/2019

CaseNet: Content-Adaptive Scale Interaction Networks for Scene Parsing

Objects in an image exhibit diverse scales. Adaptive receptive fields ar...

Please sign up or login with your details

Forgot password? Click here to reset