Reasoning for Complex Data through Ensemble-based Self-Supervised Learning

02/07/2022
by   Gabriel Bertocco, et al.
9

Self-supervised learning deals with problems that have little or no available labeled data. Recent work has shown impressive results when underlying classes have significant semantic differences. One important dataset in which this technique thrives is ImageNet, as intra-class distances are substantially lower than inter-class distances. However, this is not the case for several critical tasks, and general self-supervised learning methods fail to learn discriminative features when classes have closer semantics, thus requiring more robust strategies. We propose a strategy to tackle this problem, and to enable learning from unlabeled data even when samples from different classes are not prominently diverse. We approach the problem by leveraging a novel ensemble-based clustering strategy where clusters derived from different configurations are combined to generate a better grouping for the data samples in a fully-unsupervised way. This strategy allows clusters with different densities and higher variability to emerge, which in turn reduces intra-class discrepancies, without requiring the burden of finding an optimal configuration per dataset. We also consider different Convolutional Neural Networks to compute distances between samples. We refine these distances by performing context analysis and group them to capture complementary information. We consider two applications to validate our pipeline: Person Re-Identification and Text Authorship Verification. These are challenging applications considering that classes are semantically close to each other and that training and test sets have disjoint identities. Our method is robust across different modalities and outperforms state-of-the-art results with a fully-unsupervised solution without any labeling or human intervention.

READ FULL TEXT

page 1

page 9

page 15

page 16

research
08/25/2020

Learning to Learn in a Semi-Supervised Fashion

To address semi-supervised learning from both labeled and unlabeled data...
research
06/13/2020

Adversarial Self-Supervised Contrastive Learning

Existing adversarial learning approaches mostly use class labels to gene...
research
07/26/2023

Large-scale Fully-Unsupervised Re-Identification

Fully-unsupervised Person and Vehicle Re-Identification have received in...
research
04/26/2021

Multimodal Clustering Networks for Self-supervised Learning from Unlabeled Videos

Multimodal self-supervised learning is getting more and more attention a...
research
06/19/2023

Graph Self-Supervised Learning for Endoscopic Image Matching

Accurate feature matching and correspondence in endoscopic images play a...
research
12/06/2020

Art Style Classification with Self-Trained Ensemble of AutoEncoding Transformations

The artistic style of a painting is a rich descriptor that reveals both ...
research
07/22/2020

CrossTransformers: spatially-aware few-shot transfer

Given new tasks with very little data–such as new classes in a classific...

Please sign up or login with your details

Forgot password? Click here to reset