Crowdsourcing via Annotator Co-occurrence Imputation and Provable Symmetric Nonnegative Matrix Factorization

06/14/2021
by   Shahana Ibrahim, et al.
0

Unsupervised learning of the Dawid-Skene (D S) model from noisy, incomplete and crowdsourced annotations has been a long-standing challenge, and is a critical step towards reliably labeling massive data. A recent work takes a coupled nonnegative matrix factorization (CNMF) perspective, and shows appealing features: It ensures the identifiability of the D&S model and enjoys low sample complexity, as only the estimates of the co-occurrences of annotator labels are involved. However, the identifiability holds only when certain somewhat restrictive conditions are met in the context of crowdsourcing. Optimizing the CNMF criterion is also costly – and convergence assurances are elusive. This work recasts the pairwise co-occurrence based D S model learning problem as a symmetric NMF (SymNMF) problem – which offers enhanced identifiability relative to CNMF. In practice, the SymNMF model is often (largely) incomplete, due to the lack of co-labeled items by some annotators. Two lightweight algorithms are proposed for co-occurrence imputation. Then, a low-complexity shifted rectified linear unit (ReLU)-empowered SymNMF algorithm is proposed to identify the D S model. Various performance characterizations (e.g., missing co-occurrence recoverability, stability, and convergence) and evaluations are also presented.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/02/2017

On Identifiability of Nonnegative Matrix Factorization

In this letter, we propose a new identification criterion that guarantee...
research
06/17/2019

A Provably Correct and Robust Algorithm for Convolutive Nonnegative Matrix Factorization

In this paper, we propose a provably correct algorithm for convolutive n...
research
09/26/2019

Crowdsourcing via Pairwise Co-occurrences: Identifiability and Algorithms

The data deluge comes with high demands for data labeling. Crowdsourcing...
research
11/14/2018

Dropping Symmetry for Fast Symmetric Nonnegative Matrix Factorization

Symmetric nonnegative matrix factorization (NMF), a special but importan...
research
08/28/2018

Matrix Factorization Equals Efficient Co-occurrence Representation

Matrix factorization is a simple and effective solution to the recommend...
research
05/30/2023

Deep Clustering with Incomplete Noisy Pairwise Annotations: A Geometric Regularization Approach

The recent integration of deep learning and pairwise similarity annotati...

Please sign up or login with your details

Forgot password? Click here to reset