Self-Supervised Learning for Fine-Grained Visual Categorization

05/18/2021
by   Muhammad Maaz, et al.
10

Recent research in self-supervised learning (SSL) has shown its capability in learning useful semantic representations from images for classification tasks. Through our work, we study the usefulness of SSL for Fine-Grained Visual Categorization (FGVC). FGVC aims to distinguish objects of visually similar sub categories within a general category. The small inter-class, but large intra-class variations within the dataset makes it a challenging task. The limited availability of annotated labels for such a fine-grained data encourages the need for SSL, where additional supervision can boost learning without the cost of extra annotations. Our baseline achieves 86.36% top-1 classification accuracy on CUB-200-2011 dataset by utilizing random crop augmentation during training and center crop augmentation during testing. In this work, we explore the usefulness of various pretext tasks, specifically, rotation, pretext invariant representation learning (PIRL), and deconstruction and construction learning (DCL) for FGVC. Rotation as an auxiliary task promotes the model to learn global features, and diverts it from focusing on the subtle details. PIRL that uses jigsaw patches attempts to focus on discriminative local regions, but struggles to accurately localize them. DCL helps in learning local discriminating features and outperforms the baseline by achieving 87.41% top-1 accuracy. The deconstruction learning forces the model to focus on local object parts, while reconstruction learning helps in learning the correlation between the parts. We perform extensive experiments to reason our findings. Our code is available at https://github.com/mmaaz60/ssl_for_fgvc.

READ FULL TEXT

page 2

page 4

page 5

page 7

page 10

page 14

research
03/30/2022

Fine-Grained Object Classification via Self-Supervised Pose Alignment

Semantic patterns of fine-grained objects are determined by subtle appea...
research
07/29/2021

Self-Supervised Learning for Fine-Grained Image Classification

Fine-grained image classification involves identifying different subcate...
research
06/16/2018

Part-Aware Fine-grained Object Categorization using Weakly Supervised Part Detection Network

Fine-grained object categorization aims for distinguishing objects of su...
research
05/26/2022

On the Eigenvalues of Global Covariance Pooling for Fine-grained Visual Recognition

The Fine-Grained Visual Categorization (FGVC) is challenging because the...
research
03/03/2023

Learning Common Rationale to Improve Self-Supervised Representation for Fine-Grained Visual Recognition Problems

Self-supervised learning (SSL) strategies have demonstrated remarkable p...
research
08/03/2022

Convolutional Fine-Grained Classification with Self-Supervised Target Relation Regularization

Fine-grained visual classification can be addressed by deep representati...
research
11/11/2021

Unsupervised Part Discovery from Contrastive Reconstruction

The goal of self-supervised visual representation learning is to learn s...

Please sign up or login with your details

Forgot password? Click here to reset