CiPR: An Efficient Framework with Cross-instance Positive Relations for Generalized Category Discovery

04/14/2023
by   Shaozhe Hao, et al.
0

We tackle the issue of generalized category discovery (GCD). GCD considers the open-world problem of automatically clustering a partially labelled dataset, in which the unlabelled data contain instances from novel categories and also the labelled classes. In this paper, we address the GCD problem without a known category number in the unlabelled data. We propose a framework, named CiPR, to bootstrap the representation by exploiting Cross-instance Positive Relations for contrastive learning in the partially labelled data which are neglected in existing methods. First, to obtain reliable cross-instance relations to facilitate the representation learning, we introduce a semi-supervised hierarchical clustering algorithm, named selective neighbor clustering (SNC), which can produce a clustering hierarchy directly from the connected components in the graph constructed by selective neighbors. We also extend SNC to be capable of label assignment for the unlabelled instances with the given class number. Moreover, we present a method to estimate the unknown class number using SNC with a joint reference score considering clustering indexes of both labelled and unlabelled data. Finally, we thoroughly evaluate our framework on public generic image recognition datasets and challenging fine-grained datasets, all establishing the new state-of-the-art.

READ FULL TEXT

page 1

page 4

page 11

page 15

research
01/07/2022

Generalized Category Discovery

In this paper, we consider a highly general image recognition setting wh...
research
06/29/2021

AutoNovel: Automatically Discovering and Learning Novel Visual Categories

We tackle the problem of discovering novel classes in an image collectio...
research
04/26/2021

Joint Representation Learning and Novel Category Discovery on Single- and Multi-modal Data

This paper studies the problem of novel category discovery on single- an...
research
10/13/2020

On the Efficiency of K-Means Clustering: Evaluation, Optimization, and Algorithm Selection

This paper presents a thorough evaluation of the existing methods that a...
research
02/13/2020

Automatically Discovering and Learning New Visual Categories with Ranking Statistics

We tackle the problem of discovering novel classes in an image collectio...
research
07/07/2021

Novel Visual Category Discovery with Dual Ranking Statistics and Mutual Knowledge Distillation

In this paper, we tackle the problem of novel visual category discovery,...
research
07/07/2023

Novel Categories Discovery from probability matrix perspective

Novel Categories Discovery (NCD) tackles the open-world problem of class...

Please sign up or login with your details

Forgot password? Click here to reset