Residual Attention: A Simple but Effective Method for Multi-Label Recognition

08/05/2021
by   Ke Zhu, et al.
0

Multi-label image recognition is a challenging computer vision task of practical use. Progresses in this area, however, are often characterized by complicated methods, heavy computations, and lack of intuitive explanations. To effectively capture different spatial regions occupied by objects from different categories, we propose an embarrassingly simple module, named class-specific residual attention (CSRA). CSRA generates class-specific features for every category by proposing a simple spatial attention score, and then combines it with the class-agnostic average pooling feature. CSRA achieves state-of-the-art results on multilabel recognition, and at the same time is much simpler than them. Furthermore, with only 4 lines of code, CSRA also leads to consistent improvement across many diverse pretrained models and datasets without any extra training. CSRA is both easy to implement and light in computations, which also enjoys intuitive explanations and visualizations.

READ FULL TEXT
research
07/03/2020

Multi-Label Image Recognition with Multi-Class Attentional Regions

Multi-label image recognition is a practical and challenging task compar...
research
04/08/2022

Semantic Representation and Dependency Learning for Multi-Label Image Recognition

Recently many multi-label image recognition (MLR) works have made signif...
research
12/13/2018

ELASTIC: Improving CNNs with Instance Specific Scaling Policies

Scale variation has been a challenge from traditional to modern approach...
research
12/20/2017

Recurrent Attentional Reinforcement Learning for Multi-label Image Recognition

Recognizing multiple labels of images is a fundamental but challenging t...
research
08/03/2023

DualCoOp++: Fast and Effective Adaptation to Multi-Label Recognition with Limited Annotations

Multi-label image recognition in the low-label regime is a task of great...
research
07/10/2018

Deep Imbalanced Attribute Classification using Visual Attention Aggregation

For many computer vision applications such as image description and huma...
research
08/18/2020

Mastering Large Scale Multi-label Image Recognition with high efficiency overCamera trap images

Camera traps are crucial in biodiversity motivated studies, however deal...

Please sign up or login with your details

Forgot password? Click here to reset