HoughNet: Integrating near and long-range evidence for visual detection

04/14/2021
by   Nermin Samet, et al.
5

This paper presents HoughNet, a one-stage, anchor-free, voting-based, bottom-up object detection method. Inspired by the Generalized Hough Transform, HoughNet determines the presence of an object at a certain location by the sum of the votes cast on that location. Votes are collected from both near and long-distance locations based on a log-polar vote field. Thanks to this voting mechanism, HoughNet is able to integrate both near and long-range, class-conditional evidence for visual recognition, thereby generalizing and enhancing current object detection methodology, which typically relies on only local evidence. On the COCO dataset, HoughNet's best model achieves 46.4 AP (and 65.1 AP_50), performing on par with the state-of-the-art in bottom-up object detection and outperforming most major one-stage and two-stage methods. We further validate the effectiveness of our proposal in other visual detection tasks, namely, video object detection, instance segmentation, 3D object detection and keypoint detection for human pose estimation, and an additional “labels to photo“ image generation task, where the integration of our voting module consistently improves performance in all cases. Code is available at <https://github.com/nerminsamet/houghnet>.

READ FULL TEXT

page 1

page 9

page 11

page 12

research
07/05/2020

HoughNet: Integrating near and long-range evidence for bottom-up object detection

This paper presents HoughNet, a one-stage, anchor-free, voting-based, bo...
research
04/11/2021

Location-Sensitive Visual Recognition with Cross-IOU Loss

Object detection, instance segmentation, and pose estimation are popular...
research
07/27/2020

Corner Proposal Network for Anchor-free, Two-stage Object Detection

The goal of object detection is to determine the class and location of o...
research
04/17/2019

CenterNet: Keypoint Triplets for Object Detection

In object detection, keypoint-based approaches often suffer a large numb...
research
07/27/2021

Is Object Detection Necessary for Human-Object Interaction Recognition?

This paper revisits human-object interaction (HOI) recognition at image ...
research
10/11/2021

UrbanNet: Leveraging Urban Maps for Long Range 3D Object Detection

Relying on monocular image data for precise 3D object detection remains ...
research
12/07/2020

Rethinking Learnable Tree Filter for Generic Feature Transform

The Learnable Tree Filter presents a remarkable approach to model struct...

Please sign up or login with your details

Forgot password? Click here to reset