Pixel-wise Crowd Understanding via Synthetic Data

07/30/2020
by   Wang Qi, et al.
0

Crowd analysis via computer vision techniques is an important topic in the field of video surveillance, which has wide-spread applications including crowd monitoring, public safety, space design and so on. Pixel-wise crowd understanding is the most fundamental task in crowd analysis because of its finer results for video sequences or still images than other analysis tasks. Unfortunately, pixel-level understanding needs a large amount of labeled training data. Annotating them is an expensive work, which causes that current crowd datasets are small. As a result, most algorithms suffer from over-fitting to varying degrees. In this paper, take crowd counting and segmentation as examples from the pixel-wise crowd understanding, we attempt to remedy these problems from two aspects, namely data and methodology. Firstly, we develop a free data collector and labeler to generate synthetic and labeled crowd scenes in a computer game, Grand Theft Auto V. Then we use it to construct a large-scale, diverse synthetic crowd dataset, which is named as "GCC Dataset". Secondly, we propose two simple methods to improve the performance of crowd understanding via exploiting the synthetic data. To be specific, 1) supervised crowd understanding: pre-train a crowd analysis model on the synthetic data, then fine-tune it using the real data and labels, which makes the model perform better on the real world; 2) crowd understanding via domain adaptation: translate the synthetic data to photo-realistic images, then train the model on translated data and labels. As a result, the trained model works well in real crowd scenes.

READ FULL TEXT

page 2

page 5

page 7

page 9

page 13

page 16

page 17

research
03/08/2019

Learning from Synthetic Data for Crowd Counting in the Wild

Recently, counting the number of people for crowd scenes is a hot topic ...
research
12/08/2019

Feature-aware Adaptation and Structured Density Alignment for Crowd Counting in Video Surveillance

With the development of deep neural networks, the performance of crowd c...
research
09/29/2020

A Flow Base Bi-path Network for Cross-scene Video Crowd Understanding in Aerial View

Drones shooting can be applied in dynamic traffic monitoring, object det...
research
02/20/2020

Focus on Semantic Consistency for Cross-domain Crowd Understanding

For pixel-level crowd understanding, it is time-consuming and laborious ...
research
12/08/2019

Domain-adaptive Crowd Counting via Inter-domain Features Segregation and Gaussian-prior Reconstruction

Recently, crowd counting using supervised learning achieves a remarkable...
research
08/13/2022

UAV-CROWD: Violent and non-violent crowd activity simulator from the perspective of UAV

Unmanned Aerial Vehicle (UAV) has gained significant traction in the rec...
research
09/14/2021

Learning Bill Similarity with Annotated and Augmented Corpora of Bills

Bill writing is a critical element of representative democracy. However,...

Please sign up or login with your details

Forgot password? Click here to reset