Partial Multi-label Learning with Label and Feature Collaboration

03/17/2020
by   Tingting Yu, et al.
7

Partial multi-label learning (PML) models the scenario where each training instance is annotated with a set of candidate labels, and only some of the labels are relevant. The PML problem is practical in real-world scenarios, as it is difficult and even impossible to obtain precisely labeled samples. Several PML solutions have been proposed to combat with the prone misled by the irrelevant labels concealed in the candidate labels, but they generally focus on the smoothness assumption in feature space or low-rank assumption in label space, while ignore the negative information between features and labels. Specifically, if two instances have largely overlapped candidate labels, irrespective of their feature similarity, their ground-truth labels should be similar; while if they are dissimilar in the feature and candidate label space, their ground-truth labels should be dissimilar with each other. To achieve a credible predictor on PML data, we propose a novel approach called PML-LFC (Partial Multi-label Learning with Label and Feature Collaboration). PML-LFC estimates the confidence values of relevant labels for each instance using the similarity from both the label and feature spaces, and trains the desired predictor with the estimated confidence values. PML-LFC achieves the predictor and the latent label matrix in a reciprocal reinforce manner by a unified model, and develops an alternative optimization procedure to optimize them. Extensive empirical study on both synthetic and real-world datasets demonstrates the superiority of PML-LFC.

READ FULL TEXT

page 2

page 3

page 14

research
06/20/2020

Recovering Accurate Labeling Information from Partially Valid Data for Effective Multi-Label Learning

Partial Multi-label Learning (PML) aims to induce the multi-label predic...
research
09/09/2022

Estimating Multi-label Accuracy using Labelset Distributions

A multi-label classifier estimates the binary label state (relevant vs i...
research
05/26/2023

Disambiguated Attention Embedding for Multi-Instance Partial-Label Learning

In many real-world tasks, the concerned objects can be represented as a ...
research
05/11/2020

Multi-Level Generative Models for Partial Label Learning with Non-random Label Noise

Partial label (PL) learning tackles the problem where each training inst...
research
05/23/2023

Mitigating Label Noise through Data Ambiguation

Label noise poses an important challenge in machine learning, especially...
research
02/05/2020

Exploratory Machine Learning with Unknown Unknowns

In conventional supervised learning, a training dataset is given with gr...
research
07/27/2020

openXDATA: A Tool for Multi-Target Data Generation and Missing Label Completion

A common problem in machine learning is to deal with datasets with disjo...

Please sign up or login with your details

Forgot password? Click here to reset