Encoder-Decoder Based Convolutional Neural Networks with Multi-Scale-Aware Modules for Crowd Counting

03/12/2020
by   Pongpisit Thanasutives, et al.
0

In this paper, we proposed two modified neural network architectures based on SFANet and SegNet respectively for accurate and efficient crowd counting. Inspired by SFANet, the first model is attached with two novel multi-scale-aware modules called, ASSP and CAN. This model is called M-SFANet. The encoder of M-SFANet is enhanced with ASSP containing parallel atrous convolution with different sampling rates and hence able to extract multi-scale features of the target object and incorporate larger context. To further deal with scale variation throughout an input image, we leverage contextual module called CAN which adaptively encodes the scales of the contextual information. The combination yields an effective model for counting in both dense and sparse crowd scenes. Based on SFANet decoder structure, M-SFANet decoder has dual paths, for density map generation and attention map generation. The second model is called M-SegNet. For M-SegNet, we simply change bilinear upsampling used in SFANet to max unpooling originally from SegNet and propose the faster model while providing competitive counting performance. Designed for high-speed surveillance applications, M-SegNet has no additional multi-scale-aware module in order to not increase the complexity. Both models are encoder-decoder based architectures and end-to-end trainable. We also conduct extensive experiments on four crowd counting datasets and one vehicle counting dataset to show that these modifications yield algorithms that could outperform some of state-of-the-art crowd counting methods.

READ FULL TEXT

page 10

page 12

page 13

research
04/06/2021

Multi-Scale Context Aggregation Network with Attention-Guided for Crowd Counting

Crowd counting aims to predict the number of people and generate the den...
research
05/24/2021

Multi-Level Attentive Convoluntional Neural Network for Crowd Counting

Recently the crowd counting has received more and more attention. Especi...
research
11/29/2018

ADCrowdNet: An Attention-injective Deformable Convolutional Network for Crowd Understanding

We propose an attention-injective deformable convolutional network calle...
research
02/29/2020

NAS-Count: Counting-by-Density with Neural Architecture Search

Most of the recent advances in crowd counting have evolved from hand-des...
research
11/26/2018

Context-Aware Crowd Counting

State-of-the-art methods for counting people in crowded scenes rely on d...
research
09/09/2019

Crowd Counting on Images with Scale Variation and Isolated Clusters

Crowd counting is to estimate the number of objects (e.g., people or veh...
research
04/17/2019

DENet: A Universal Network for Counting Crowd with Varying Densities and Scales

Counting people or objects with significantly varying scales and densiti...

Please sign up or login with your details

Forgot password? Click here to reset