Convolutional Cross-View Pose Estimation

by   Zimin Xia, et al.

We propose a novel end-to-end method for cross-view pose estimation. Given a ground-level query image and an aerial image that covers the query's local neighborhood, the 3 Degrees-of-Freedom camera pose of the query is estimated by matching its image descriptor to descriptors of local regions within the aerial image. The orientation-aware descriptors are obtained by using a translational equivariant convolutional ground image encoder and contrastive learning. The Localization Decoder produces a dense probability distribution in a coarse-to-fine manner with a novel Localization Matching Upsampling module. A smaller Orientation Decoder produces a vector field to condition the orientation estimate on the localization. Our method is validated on the VIGOR and KITTI datasets, where it surpasses the state-of-the-art baseline by 72 36 accuracy. The predicted probability distribution can represent localization ambiguity, and enables rejecting possible erroneous predictions. Without re-training, the model can infer on ground images with different field of views and utilize orientation priors if available. On the Oxford RobotCar dataset, our method can reliably estimate the ego-vehicle's pose over time, achieving a median localization error under 1 meter and a median orientation error of around 1 degree at 14 FPS.


page 2

page 5

page 6

page 10

page 11

page 13

page 14

page 15


SliceMatch: Geometry-guided Aggregation for Cross-View Pose Estimation

This work addresses cross-view camera pose estimation, i.e., determining...

View Consistent Purification for Accurate Cross-View Localization

This paper proposes a fine-grained self-localization method for outdoor ...

Deep MANTA: A Coarse-to-fine Many-Task Network for joint 2D and 3D vehicle analysis from monocular image

In this paper, we present a novel approach, called Deep MANTA (Deep Many...

Real-Time Camera Pose Estimation for Sports Fields

Given an image sequence featuring a portion of a sports field filmed by ...

Uncertainty-aware Vision-based Metric Cross-view Geolocalization

This paper proposes a novel method for vision-based metric cross-view ge...

Towards Accurate Camera Geopositioning by Image Matching

In this work, we present a camera geopositioning system based on matchin...

Render-and-Compare: Cross-View 6 DoF Localization from Noisy Prior

Despite the significant progress in 6-DoF visual localization, researche...

Please sign up or login with your details

Forgot password? Click here to reset