Adversarial Extreme Multi-label Classification

03/05/2018
by   Rohit Babbar, et al.
0

The goal in extreme multi-label classification is to learn a classifier which can assign a small subset of relevant labels to an instance from an extremely large set of target labels. Datasets in extreme classification exhibit a long tail of labels which have small number of positive training instances. In this work, we pose the learning task in extreme classification with large number of tail-labels as learning in the presence of adversarial perturbations. This view motivates a robust optimization framework and equivalence to a corresponding regularized objective. Under the proposed robustness framework, we demonstrate efficacy of Hamming loss for tail-label detection in extreme classification. The equivalent regularized objective, in combination with proximal gradient based optimization, performs better than state-of-the-art methods on propensity scored versions of precision@k and nDCG@k(upto 20 PFastreXML - a leading tree-based approach and 60 SLEEC - a leading label-embedding approach). Furthermore, we also highlight the sub-optimality of a sparse solver in a widely used package for large-scale linear classification, which is interesting in its own right. We also investigate the spectral properties of label graphs for providing novel insights towards understanding the conditions governing the performance of Hamming loss based one-vs-rest scheme vis-à-vis label embedding methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/08/2016

DiSMEC - Distributed Sparse Machines for Extreme Multi-label Classification

Extreme multi-label classification refers to supervised multi-label lear...
research
07/01/2020

Unbiased Loss Functions for Extreme Classification With Missing Labels

The goal in extreme multi-label classification (XMC) is to tag an instan...
research
02/12/2023

Review of Extreme Multilabel Classification

Extreme multilabel classification or XML, in short, has emerged as a new...
research
03/08/2018

Efficient Loss-Based Decoding On Graphs For Extreme Classification

In extreme classification problems, learning algorithms are required to ...
research
09/27/2021

Speeding-up One-vs-All Training for Extreme Classification via Smart Initialization

In this paper we show that a simple, data dependent way of setting the i...
research
06/24/2021

Label Disentanglement in Partition-based Extreme Multilabel Classification

Partition-based methods are increasingly-used in extreme multi-label cla...
research
10/20/2021

Propensity-scored Probabilistic Label Trees

Extreme multi-label classification (XMLC) refers to the task of tagging ...

Please sign up or login with your details

Forgot password? Click here to reset