Label Disentanglement in Partition-based Extreme Multilabel Classification

06/24/2021
by   Xuanqing Liu, et al.
0

Partition-based methods are increasingly-used in extreme multi-label classification (XMC) problems due to their scalability to large output spaces (e.g., millions or more). However, existing methods partition the large label space into mutually exclusive clusters, which is sub-optimal when labels have multi-modality and rich semantics. For instance, the label "Apple" can be the fruit or the brand name, which leads to the following research question: can we disentangle these multi-modal labels with non-exclusive clustering tailored for downstream XMC tasks? In this paper, we show that the label assignment problem in partition-based XMC can be formulated as an optimization problem, with the objective of maximizing precision rates. This leads to an efficient algorithm to form flexible and overlapped label clusters, and a method that can alternatively optimizes the cluster assignments and the model parameters for partition-based XMC. Experimental results on synthetic and real datasets show that our method can successfully disentangle multi-modal labels, leading to state-of-the-art (SOTA) results on four XMC benchmarks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/16/2022

End-to-End Learning to Index and Search in Large Output Spaces

Extreme multi-label classification (XMC) is a popular framework for solv...
research
09/23/2016

A Novel Progressive Multi-label Classifier for Classincremental Data

In this paper, a progressive learning algorithm for multi-label classifi...
research
02/25/2022

On Modality Bias Recognition and Reduction

Making each modality in multi-modal data contribute is of vital importan...
research
11/04/2018

Block-wise Partitioning for Extreme Multi-label Classification

Extreme multi-label classification aims to learn a classifier that annot...
research
03/05/2018

Adversarial Extreme Multi-label Classification

The goal in extreme multi-label classification is to learn a classifier ...
research
04/17/2021

Semi-Supervised Multi-Modal Multi-Instance Multi-Label Deep Network with Optimal Transport

Complex objects are usually with multiple labels, and can be represented...
research
09/18/2019

Efficient Computation of Multi-Modal Public Transit Traffic Assignments using ULTRA

We study the problem of computing public transit traffic assignments in ...

Please sign up or login with your details

Forgot password? Click here to reset