MultiDEC: Multi-Modal Clustering of Image-Caption Pairs

01/04/2019
by   Sean Yang, et al.
0

In this paper, we propose a method for clustering image-caption pairs by simultaneously learning image representations and text representations that are constrained to exhibit similar distributions. These image-caption pairs arise frequently in high-value applications where structured training data is expensive to produce but free-text descriptions are common. MultiDEC initializes parameters with stacked autoencoders, then iteratively minimizes the Kullback-Leibler divergence between the distribution of the images (and text) to that of a combined joint target distribution. We regularize by penalizing non-uniform distributions across clusters. The representations that minimize this objective produce clusters that outperform both single-view and multi-view techniques on large benchmark image-caption datasets.

READ FULL TEXT
research
07/14/2023

MMSD2.0: Towards a Reliable Multi-modal Sarcasm Detection System

Multi-modal sarcasm detection has attracted much recent attention. Never...
research
10/02/2020

Deep Incomplete Multi-View Multiple Clusterings

Multi-view clustering aims at exploiting information from multiple heter...
research
07/11/2021

Locality Relationship Constrained Multi-view Clustering Framework

In most practical applications, it's common to utilize multiple features...
research
02/17/2023

Multi-View Clustering from the Perspective of Mutual Information

Exploring the complementary information of multi-view data to improve cl...
research
08/01/2023

Relation-Aware Distribution Representation Network for Person Clustering with Multiple Modalities

Person clustering with multi-modal clues, including faces, bodies, and v...
research
03/13/2021

Reconsidering Representation Alignment for Multi-view Clustering

Aligning distributions of view representations is a core component of to...
research
11/03/2021

LAION-400M: Open Dataset of CLIP-Filtered 400 Million Image-Text Pairs

Multi-modal language-vision models trained on hundreds of millions of im...

Please sign up or login with your details

Forgot password? Click here to reset