A Survey on Long-Tailed Visual Recognition

05/27/2022
by   Lu Yang, et al.
0

The heavy reliance on data is one of the major reasons that currently limit the development of deep learning. Data quality directly dominates the effect of deep learning models, and the long-tailed distribution is one of the factors affecting data quality. The long-tailed phenomenon is prevalent due to the prevalence of power law in nature. In this case, the performance of deep learning models is often dominated by the head classes while the learning of the tail classes is severely underdeveloped. In order to learn adequately for all classes, many researchers have studied and preliminarily addressed the long-tailed problem. In this survey, we focus on the problems caused by long-tailed data distribution, sort out the representative long-tailed visual recognition datasets and summarize some mainstream long-tailed studies. Specifically, we summarize these studies into ten categories from the perspective of representation learning, and outline the highlights and limitations of each category. Besides, we have studied four quantitative metrics for evaluating the imbalance, and suggest using the Gini coefficient to evaluate the long-tailedness of a dataset. Based on the Gini coefficient, we quantitatively study 20 widely-used and large-scale visual datasets proposed in the last decade, and find that the long-tailed phenomenon is widespread and has not been fully studied. Finally, we provide several future directions for the development of long-tailed learning to provide more ideas for readers.

READ FULL TEXT

page 2

page 24

research
10/09/2021

Deep Long-Tailed Learning: A Survey

Deep long-tailed learning, one of the most challenging problems in visua...
research
07/17/2023

HeroLT: Benchmarking Heterogeneous Long-Tailed Learning

Long-tailed data distributions are prevalent in a variety of domains, in...
research
07/08/2021

Investigate the Essence of Long-Tailed Recognition from a Unified Perspective

As the data scale grows, deep recognition models often suffer from long-...
research
09/07/2023

The Devil is in the Tails: How Long-Tailed Code Distributions Impact Large Language Models

Learning-based techniques, especially advanced Large Language Models (LL...
research
09/28/2020

Long-Tailed Classification by Keeping the Good and Removing the Bad Momentum Causal Effect

As the class size grows, maintaining a balanced dataset across many clas...
research
07/06/2021

Predicate correlation learning for scene graph generation

For a typical Scene Graph Generation (SGG) method, there is often a larg...
research
08/09/2020

What Neural Networks Memorize and Why: Discovering the Long Tail via Influence Estimation

Deep learning algorithms are well-known to have a propensity for fitting...

Please sign up or login with your details

Forgot password? Click here to reset