Cross-Task Transfer for Geotagged Audiovisual Aerial Scene Recognition

05/18/2020
by   Di Hu, et al.
0

Aerial scene recognition is a fundamental task in remote sensing and has recently received increased interest. While the visual information from overhead images with powerful models and efficient algorithms yields considerable performance on scene recognition, it still suffers from the variation of ground objects, lighting conditions etc. Inspired by the multi-channel perception theory in cognition science, in this paper, for improving the performance on the aerial scene recognition, we explore a novel audiovisual aerial scene recognition task using both images and sounds as input. Based on an observation that some specific sound events are more likely to be heard at a given geographic location, we propose to exploit the knowledge from the sound events to improve the performance on the aerial scene recognition. For this purpose, we have constructed a new dataset named AuDio Visual Aerial sceNe reCognition datasEt (ADVANCE). With the help of this dataset, we evaluate three proposed approaches for transferring the sound event knowledge to the aerial scene recognition task in a multimodal learning framework, and show the benefit of exploiting the audio information for the aerial scene recognition. The source code is publicly available for reproducibility purposes.

READ FULL TEXT
research
05/18/2020

Cross-Task Transfer for Multimodal Aerial Scene Recognition

Aerial scene recognition is a fundamental task in remote sensing and has...
research
08/18/2016

AID: A Benchmark Dataset for Performance Evaluation of Aerial Scene Classification

Aerial scene classification, which aims to automatically label an aerial...
research
04/22/2021

Aerial Scene Understanding in The Wild: Multi-Scene Recognition via Prototype-based Memory Networks

Aerial scene recognition is a fundamental visual task and has attracted ...
research
08/30/2023

AGS: An Dataset and Taxonomy for Domestic Scene Sound Event Recognition

Environmental sound scene and sound event recognition is important for t...
research
04/07/2021

MultiScene: A Large-scale Dataset and Benchmark for Multi-scene Recognition in Single Aerial Images

Aerial scene recognition is a fundamental research problem in interpreti...
research
01/30/2020

ERA: A Dataset and Deep Learning Benchmark for Event Recognition in Aerial Videos

Along with the increasing use of unmanned aerial vehicles (UAVs), large ...
research
09/09/2022

Prediction method of Soundscape Impressions using Environmental Sounds and Aerial Photographs

We investigate an method for quantifying city characteristics based on i...

Please sign up or login with your details

Forgot password? Click here to reset