Partial Multi-label Learning with Label and Feature Collaboration

by   Tingting Yu, et al.

Partial multi-label learning (PML) models the scenario where each training instance is annotated with a set of candidate labels, and only some of the labels are relevant. The PML problem is practical in real-world scenarios, as it is difficult and even impossible to obtain precisely labeled samples. Several PML solutions have been proposed to combat with the prone misled by the irrelevant labels concealed in the candidate labels, but they generally focus on the smoothness assumption in feature space or low-rank assumption in label space, while ignore the negative information between features and labels. Specifically, if two instances have largely overlapped candidate labels, irrespective of their feature similarity, their ground-truth labels should be similar; while if they are dissimilar in the feature and candidate label space, their ground-truth labels should be dissimilar with each other. To achieve a credible predictor on PML data, we propose a novel approach called PML-LFC (Partial Multi-label Learning with Label and Feature Collaboration). PML-LFC estimates the confidence values of relevant labels for each instance using the similarity from both the label and feature spaces, and trains the desired predictor with the estimated confidence values. PML-LFC achieves the predictor and the latent label matrix in a reciprocal reinforce manner by a unified model, and develops an alternative optimization procedure to optimize them. Extensive empirical study on both synthetic and real-world datasets demonstrates the superiority of PML-LFC.


page 2

page 3

page 14


Recovering Accurate Labeling Information from Partially Valid Data for Effective Multi-Label Learning

Partial Multi-label Learning (PML) aims to induce the multi-label predic...

Estimating Multi-label Accuracy using Labelset Distributions

A multi-label classifier estimates the binary label state (relevant vs i...

Disambiguated Attention Embedding for Multi-Instance Partial-Label Learning

In many real-world tasks, the concerned objects can be represented as a ...

Multi-Level Generative Models for Partial Label Learning with Non-random Label Noise

Partial label (PL) learning tackles the problem where each training inst...

Mitigating Label Noise through Data Ambiguation

Label noise poses an important challenge in machine learning, especially...

Exploratory Machine Learning with Unknown Unknowns

In conventional supervised learning, a training dataset is given with gr...

openXDATA: A Tool for Multi-Target Data Generation and Missing Label Completion

A common problem in machine learning is to deal with datasets with disjo...

Please sign up or login with your details

Forgot password? Click here to reset