Improving the performance of object detection by preserving label distribution

by   Heewon Lee, et al.

Object detection is a task that performs position identification and label classification of objects in images or videos. The information obtained through this process plays an essential role in various tasks in the field of computer vision. In object detection, the data utilized for training and validation typically originate from public datasets that are well-balanced in terms of the number of objects ascribed to each class in an image. However, in real-world scenarios, handling datasets with much greater class imbalance, i.e., very different numbers of objects for each class , is much more common, and this imbalance may reduce the performance of object detection when predicting unseen test images. In our study, thus, we propose a method that evenly distributes the classes in an image for training and validation, solving the class imbalance problem in object detection. Our proposed method aims to maintain a uniform class distribution through multi-label stratification. We tested our proposed method not only on public datasets that typically exhibit balanced class distribution but also on custom datasets that may have imbalanced class distribution. We found that our proposed method was more effective on datasets containing severe imbalance and less data. Our findings indicate that the proposed method can be effectively used on datasets with substantially imbalanced class distribution.


page 2

page 9

page 10


Large-Scale Object Detection in the Wild from Imbalanced Multi-Labels

Training with more data has always been the most stable and effective wa...

ARUBA: An Architecture-Agnostic Balanced Loss for Aerial Object Detection

Deep neural networks tend to reciprocate the bias of their training data...

LARD: Large-scale Artificial Disfluency Generation

Disfluency detection is a critical task in real-time dialogue systems. H...

DeepScores and Deep Watershed Detection: current state and open issues

This paper gives an overview of our current Optical Music Recognition (O...

YolOOD: Utilizing Object Detection Concepts for Out-of-Distribution Detection

Out-of-distribution (OOD) detection has attracted a large amount of atte...

DHARI Report to EPIC-Kitchens 2020 Object Detection Challenge

In this report, we describe the technical details of oursubmission to th...

Harmonizing Output Imbalance for semantic segmentation on extremely-imbalanced input data

Semantic segmentation is a high level computer vision task that assigns ...

Please sign up or login with your details

Forgot password? Click here to reset