Bridging the Domain Gap for Ground-to-Aerial Image Matching

by   Krishna Regmi, et al.

The visual entities in cross-view images exhibit drastic domain changes due to the difference in viewpoints each set of images is captured from. Existing state-of-the-art methods address the problem by learning view-invariant descriptors for the images. We propose a novel method for solving this task by exploiting the generative powers of conditional GANs to synthesize an aerial representation of a ground level panorama and use it to minimize the domain gap between the two views. The synthesized image being from the same view as the target image helps the network to preserve important cues in aerial images following our Joint Feature Learning approach. Our Feature Fusion method combines the complementary features from a synthesized aerial image with the corresponding ground features to obtain a robust query representation. In addition, multi-scale feature aggregation preserves image representations at different feature scales useful for solving this complex task. Experimental results show that our proposed approach performs significantly better than the state-of-the-art methods on the challenging CVUSA dataset in terms of top-1 and top-1 method on urban landscapes, we collected a new cross-view localization dataset with geo-reference information.


page 1

page 3

page 6

page 7

page 8

page 12

page 13

page 14


Wide-Area Image Geolocalization with Aerial Reference Imagery

We propose to use deep convolutional neural networks to address the prob...

Leveraging Photogrammetric Mesh Models for Aerial-Ground Feature Point Matching Toward Integrated 3D Reconstruction

Integration of aerial and ground images has been proved as an efficient ...

Mutual Generative Transformer Learning for Cross-view Geo-localization

Cross-view geo-localization (CVGL), which aims to estimate the geographi...

GeoCapsNet: Aerial to Ground view Image Geo-localization using Capsule Network

The task of cross-view image geo-localization aims to determine the geo-...

Optimal Feature Transport for Cross-View Image Geo-Localization

This paper addresses the problem of cross-view image based localization,...

Hierarchically Learned View-Invariant Representations for Cross-View Action Recognition

Recognizing human actions from varied views is challenging due to huge a...

Retrieval-based Localization Based on Domain-invariant Feature Learning under Changing Environments

Visual localization is a crucial problem in mobile robotics and autonomo...

Please sign up or login with your details

Forgot password? Click here to reset