Image Clustering via the Principle of Rate Reduction in the Age of Pretrained Models

06/08/2023
by   Tianzhe Chu, et al.
0

The advent of large pre-trained models has brought about a paradigm shift in both visual representation learning and natural language processing. However, clustering unlabeled images, as a fundamental and classic machine learning problem, still lacks effective solution, particularly for large-scale datasets. In this paper, we propose a novel image clustering pipeline that leverages the powerful feature representation of large pre-trained models such as CLIP and cluster images effectively and efficiently at scale. We show that the pre-trained features are significantly more structured by further optimizing the rate reduction objective. The resulting features may significantly improve the clustering accuracy, e.g., from 57% to 66% on ImageNet-1k. Furthermore, by leveraging CLIP's image-text binding, we show how the new clustering method leads to a simple yet effective self-labeling algorithm that successfully works on unlabeled large datasets such as MS-COCO and LAION-Aesthetics. We will release the code in https://github.com/LeslieTrue/CPP.

READ FULL TEXT

page 2

page 8

page 9

page 17

page 18

page 19

page 20

page 21

research
03/23/2023

Exploring Visual Prompts for Whole Slide Image Classification with Multiple Instance Learning

Multiple instance learning (MIL) has emerged as a popular method for cla...
research
05/19/2017

CNN-Based Joint Clustering and Representation Learning with Feature Drift Compensation for Large-Scale Image Data

Given a large unlabeled set of images, how to efficiently and effectivel...
research
03/23/2023

CrOC: Cross-View Online Clustering for Dense Visual Representation Learning

Learning dense visual representations without labels is an arduous task ...
research
12/02/2021

DenseCLIP: Language-Guided Dense Prediction with Context-Aware Prompting

Recent progress has shown that large-scale pre-training using contrastiv...
research
04/12/2023

Unicom: Universal and Compact Representation Learning for Image Retrieval

Modern image retrieval methods typically rely on fine-tuning pre-trained...
research
01/19/2019

Deep Representation Learning Characterized by Inter-class Separation for Image Clustering

Despite significant advances in clustering methods in recent years, the ...
research
07/15/2023

Can Pre-Trained Text-to-Image Models Generate Visual Goals for Reinforcement Learning?

Pre-trained text-to-image generative models can produce diverse, semanti...

Please sign up or login with your details

Forgot password? Click here to reset