SSPNet: Scale Selection Pyramid Network for Tiny Person Detection from UAV Images

by   Mingbo Hong, et al.

With the increasing demand for search and rescue, it is highly demanded to detect objects of interest in large-scale images captured by Unmanned Aerial Vehicles (UAVs), which is quite challenging due to extremely small scales of objects. Most existing methods employed Feature Pyramid Network (FPN) to enrich shallow layers' features by combing deep layers' contextual features. However, under the limitation of the inconsistency in gradient computation across different layers, the shallow layers in FPN are not fully exploited to detect tiny objects. In this paper, we propose a Scale Selection Pyramid network (SSPNet) for tiny person detection, which consists of three components: Context Attention Module (CAM), Scale Enhancement Module (SEM), and Scale Selection Module (SSM). CAM takes account of context information to produce hierarchical attention heatmaps. SEM highlights features of specific scales at different layers, leading the detector to focus on objects of specific scales instead of vast backgrounds. SSM exploits adjacent layers' relationships to fulfill suitable feature sharing between deep layers and shallow layers, thereby avoiding the inconsistency in gradient computation across different layers. Besides, we propose a Weighted Negative Sampling (WNS) strategy to guide the detector to select more representative samples. Experiments on the TinyPerson benchmark show that our method outperforms other state-of-the-art (SOTA) detectors.


page 1

page 3

page 5

page 6


YOLOv3 with Spatial Pyramid Pooling for Object Detection with Unmanned Aerial Vehicles

Object detection with Unmanned Aerial Vehicles (UAVs) has attracted much...

NETNet: Neighbor Erasing and Transferring Network for Better Single Shot Object Detection

Due to the advantages of real-time detection and improved performance, s...

Contextual Multi-Scale Region Convolutional 3D Network for Activity Detection

Activity detection is a fundamental problem in computer vision. Detectin...

Shallow Feature Based Dense Attention Network for Crowd Counting

While the performance of crowd counting via deep learning has been impro...

SSSDET: Simple Short and Shallow Network for Resource Efficient Vehicle Detection in Aerial Scenes

Detection of small-sized targets is of paramount importance in many aeri...

Person Re-identification via Attention Pyramid

In this paper, we propose an attention pyramid method for person re-iden...

Detector With Focus: Normalizing Gradient In Image Pyramid

An image pyramid can extend many object detection algorithms to solve de...

Please sign up or login with your details

Forgot password? Click here to reset