Sparse MDOD: Training End-to-End Multi-Object Detector without Bipartite Matching

05/18/2022
by   Jaeyoung Yoo, et al.
0

Recent end-to-end multi-object detectors simplify the inference pipeline by removing the hand-crafted process such as the duplicate bounding box removal using non-maximum suppression (NMS). However, in the training, they require bipartite matching to calculate the loss from the output of the detector. Contrary to the directivity of the end-to-end method, the bipartite matching makes the training of the end-to-end detector complex, heuristic, and reliant. In this paper, we aim to propose a method to train the end-to-end multi-object detector without bipartite matching. To this end, we approach end-to-end multi-object detection as a density estimation using a mixture model. Our proposed detector, called Sparse Mixture Density Object Detector (Sparse MDOD) estimates the distribution of bounding boxes using a mixture model. Sparse MDOD is trained by minimizing the negative log-likelihood and our proposed regularization term, maximum component maximization (MCM) loss that prevents duplicated predictions. During training, no additional procedure such as bipartite matching is needed, and the loss is directly computed from the network outputs. Moreover, our Sparse MDOD outperforms the existing detectors on MS-COCO, a renowned multi-object detection benchmark.

READ FULL TEXT
research
11/28/2019

Mixture-Model-based Bounding Box Density Estimation for Object Detection

In this paper, we propose a new object detection model, Mixture-Model-ba...
research
06/15/2023

DEYOv2: Rank Feature with Greedy Matching for End-to-End Object Detection

This paper presents a novel object detector called DEYOv2, an improved v...
research
07/12/2016

End-to-end training of object class detectors for mean average precision

We present a method for training CNN-based object class detectors direct...
research
12/12/2022

NMS Strikes Back

Detection Transformer (DETR) directly transforms queries to unique objec...
research
03/18/2021

SparsePoint: Fully End-to-End Sparse 3D Object Detector

Object detectors based on sparse object proposals have recently been pro...
research
06/04/2021

NMS-Loss: Learning with Non-Maximum Suppression for Crowded Pedestrian Detection

Non-Maximum Suppression (NMS) is essential for object detection and affe...
research
05/08/2017

Learning non-maximum suppression

Object detectors have hugely profited from moving towards an end-to-end ...

Please sign up or login with your details

Forgot password? Click here to reset