SpaceNet MVOI: a Multi-View Overhead Imagery Dataset

by   Nicholas Weir, et al.
In-Q-Tel, Inc.

Detection and segmentation of objects in overheard imagery is a challenging task. The variable density, random orientation, small size, and instance-to-instance heterogeneity of objects in overhead imagery calls for approaches distinct from existing models designed for natural scene datasets. Though new overhead imagery datasets are being developed, they almost universally comprise a single view taken from directly overhead ("at nadir"), failing to address one critical variable: look angle. By contrast, views vary in real-world overhead imagery, particularly in dynamic scenarios such as natural disasters where first looks are often over 40 degrees off-nadir. This represents an important challenge to computer vision methods, as changing view angle adds distortions, alters resolution, and changes lighting. At present, the impact of these perturbations for algorithmic detection and segmentation of objects is untested. To address this problem, we introduce the SpaceNet Multi-View Overhead Imagery (MVOI) Dataset, an extension of the SpaceNet open source remote sensing dataset. MVOI comprises 27 unique looks from a broad range of viewing angles (-32 to 54 degrees). Each of these images cover the same geography and are annotated with 126,747 building footprint labels, enabling direct assessment of the impact of viewpoint perturbation on model performance. We benchmark multiple leading segmentation and object detection models on: (1) building detection, (2) generalization to unseen viewing angles and resolutions, and (3) sensitivity of building footprint extraction to changes in resolution. We find that segmentation and object detection models struggle to identify buildings in off-nadir imagery and generalize poorly to unseen views, presenting an important benchmark to explore the broadly relevant challenge of detecting small, heterogeneous target objects in visually dynamic contexts.


page 3

page 4

page 7


xView: Objects in Context in Overhead Imagery

We introduce a new large-scale dataset for the advancement of object det...

Towards seamless multi-view scene analysis from satellite to street-level

In this paper, we discuss and review how combined multi-view imagery fro...

Improving Building Segmentation for Off-Nadir Satellite Imagery

Automatic building segmentation is an important task for satellite image...

The Multi-Temporal Urban Development SpaceNet Dataset

Satellite imagery analytics have numerous human development and disaster...

Road Network and Travel Time Extraction from Multiple Look Angles with SpaceNet Data

Identification of road networks and optimal routes directly from remote ...

Segment anything, from space?

Recently, the first foundation model developed specifically for vision t...

Evolving Evocative 2D Views of Generated 3D Objects

We present a method for jointly generating 3D models of objects and 2D r...

Please sign up or login with your details

Forgot password? Click here to reset