Active Refinement for Multi-Label Learning: A Pseudo-Label Approach

by   Cheng-Yu Hsieh, et al.
National Taiwan University
The University of Queensland
The University of Tokyo

The goal of multi-label learning (MLL) is to associate a given instance with its relevant labels from a set of concepts. Previous works of MLL mainly focused on the setting where the concept set is assumed to be fixed, while many real-world applications require introducing new concepts into the set to meet new demands. One common need is to refine the original coarse concepts and split them into finer-grained ones, where the refinement process typically begins with limited labeled data for the finer-grained concepts. To address the need, we formalize the problem into a special weakly supervised MLL problem to not only learn the fine-grained concepts efficiently but also allow interactive queries to strategically collect more informative annotations to further improve the classifier. The key idea within our approach is to learn to assign pseudo-labels to the unlabeled entries, and in turn leverage the pseudo-labels to train the underlying classifier and to inform a better query strategy. Experimental results demonstrate that our pseudo-label approach is able to accurately recover the missing ground truth, boosting the prediction performance significantly over the baseline methods and facilitating a competitive active learning strategy.


Pseudo Labels for Single Positive Multi-Label Learning

The cost of data annotation is a substantial impediment for multi-label ...

Coarse2Fine: Fine-grained Text Classification on Coarsely-grained Annotated Data

Existing text classification methods mainly focus on a fixed label set, ...

Class-Distribution-Aware Pseudo Labeling for Semi-Supervised Multi-Label Learning

Pseudo labeling is a popular and effective method to leverage the inform...

Towards Coarse and Fine-grained Multi-Graph Multi-Label Learning

Multi-graph multi-label learning (Mgml) is a supervised learning framewo...

Pseudo Labels Regularization for Imbalanced Partial-Label Learning

Partial-label learning (PLL) is an important branch of weakly supervised...

SemPPL: Predicting pseudo-labels for better contrastive representations

Learning from large amounts of unsupervised data and a small amount of s...

Towards Automated Real-time Evaluation in Text-based Counseling

Automated real-time evaluation of counselor-client interaction is import...

Please sign up or login with your details

Forgot password? Click here to reset