Interpretable Deep Clustering

06/07/2023
by   Jonathan Svirsky, et al.
0

Clustering is a fundamental learning task widely used as a first step in data analysis. For example, biologists often use cluster assignments to analyze genome sequences, medical records, or images. Since downstream analysis is typically performed at the cluster level, practitioners seek reliable and interpretable clustering models. We propose a new deep-learning framework that predicts interpretable cluster assignments at the instance and cluster levels. First, we present a self-supervised procedure to identify a subset of informative features from each data point. Then, we design a model that predicts cluster assignments and a gate matrix that leads to cluster-level feature selection. We show that the proposed method can reliably predict cluster assignments using synthetic and real data. Furthermore, we verify that our model leads to interpretable results at a sample and cluster level.

READ FULL TEXT

page 2

page 7

page 8

research
04/03/2021

Graph Contrastive Clustering

Recently, some contrastive learning methods have been proposed to simult...
research
04/21/2023

Deep Multiview Clustering by Contrasting Cluster Assignments

Multiview clustering (MVC) aims to reveal the underlying structure of mu...
research
03/29/2023

Hard Regularization to Prevent Collapse in Online Deep Clustering without Data Augmentation

Online deep clustering refers to the joint use of a feature extraction n...
research
10/09/2020

Cluster Activation Mapping with Applications to Medical Imaging

An open question in deep clustering is how to understand what in the ima...
research
07/26/2018

Selective Clustering Annotated using Modes of Projections

Selective clustering annotated using modes of projections (SCAMP) is a n...
research
10/19/2022

Functional data clustering via information maximization

A new method for clustering functional data is proposed via information ...
research
09/21/2022

Algorithm-Agnostic Interpretations for Clustering

A clustering outcome for high-dimensional data is typically interpreted ...

Please sign up or login with your details

Forgot password? Click here to reset