Self-Supervised Object Detection via Generative Image Synthesis

We present SSOD, the first end-to-end analysis-by synthesis framework with controllable GANs for the task of self-supervised object detection. We use collections of real world images without bounding box annotations to learn to synthesize and detect objects. We leverage controllable GANs to synthesize images with pre-defined object properties and use them to train object detectors. We propose a tight end-to-end coupling of the synthesis and detection networks to optimally train our system. Finally, we also propose a method to optimally adapt SSOD to an intended target data without requiring labels for it. For the task of car detection, on the challenging KITTI and Cityscapes datasets, we show that SSOD outperforms the prior state-of-the-art purely image-based self-supervised object detection method Wetectron. Even without requiring any 3D CAD assets, it also surpasses the state-of-the-art rendering based method Meta-Sim2. Our work advances the field of self-supervised object detection by introducing a successful new paradigm of using controllable GAN-based image synthesis for it and by significantly improving the baseline accuracy of the task. We open-source our code at https://github.com/NVlabs/SSOD.

READ FULL TEXT

page 1

page 4

page 8

page 14

page 15

page 16

page 17

research
03/04/2021

Data Augmentation for Object Detection via Differentiable Neural Rendering

It is challenging to train a robust object detector when annotated data ...
research
08/13/2023

Camouflaged Image Synthesis Is All You Need to Boost Camouflaged Detection

Camouflaged objects that blend into natural scenes pose significant chal...
research
04/03/2020

Self-Supervised Viewpoint Learning From Image Collections

Training deep neural networks to estimate the viewpoint of objects requi...
research
07/24/2019

Semi-parametric Object Synthesis

We present a new semi-parametric approach to synthesize novel views of a...
research
12/19/2017

Learning Fixation Point Strategy for Object Detection and Classification

We propose a novel recurrent attentional structure to localize and recog...
research
08/05/2022

Exploring Resolution and Degradation Clues as Self-supervised Signal for Low Quality Object Detection

Image restoration algorithms such as super resolution (SR) are indispens...
research
05/06/2022

Multitask AET with Orthogonal Tangent Regularity for Dark Object Detection

Dark environment becomes a challenge for computer vision algorithms owin...

Please sign up or login with your details

Forgot password? Click here to reset