Learning a Layout Transfer Network for Context Aware Object Detection

12/09/2019
by   PetsTime, et al.
5

We present a context aware object detection method based on a retrieve-and-transform scene layout model. Given an input image, our approach first retrieves a coarse scene layout from a codebook of typical layout templates. In order to handle large layout variations, we use a variant of the spatial transformer network to transform and refine the retrieved layout, resulting in a set of interpretable and semantically meaningful feature maps of object locations and scales. The above steps are implemented as a Layout Transfer Network which we integrate into Faster RCNN to allow for joint reasoning of object detection and scene layout estimation. Extensive experiments on three public datasets verified that our approach provides consistent performance improvements to the state-of-the-art object detection baselines on a variety of challenging tasks in the traffic surveillance and the autonomous driving domains.

READ FULL TEXT

page 1

page 2

page 4

page 6

page 10

page 12

page 13

page 15

research
09/19/2022

A Dual-Cycled Cross-View Transformer Network for Unified Road Layout Estimation and 3D Object Detection in the Bird's-Eye-View

The bird's-eye-view (BEV) representation allows robust learning of multi...
research
03/11/2021

Holistic 3D Scene Understanding from a Single Image with Implicit Representation

We present a new pipeline for holistic 3D scene understanding from a sin...
research
03/22/2021

Context-Aware Layout to Image Generation with Enhanced Object Appearance

A layout to image (L2I) generation model aims to generate a complicated ...
research
09/02/2020

Intrinsic Relationship Reasoning for Small Object Detection

The small objects in images and videos are usually not independent indiv...
research
12/06/2021

Context-Aware Transfer Attacks for Object Detection

Blackbox transfer attacks for image classifiers have been extensively st...
research
08/22/2018

Multidomain Document Layout Understanding using Few Shot Object Detection

We try to address the problem of document layout understanding using a s...
research
08/20/2021

AutoLay: Benchmarking amodal layout estimation for autonomous driving

Given an image or a video captured from a monocular camera, amodal layou...

Please sign up or login with your details

Forgot password? Click here to reset