Denoising individual bias for a fairer binary submatrix detection

07/31/2020
by   Changlin Wan, et al.
3

Low rank representation of binary matrix is powerful in disentangling sparse individual-attribute associations, and has received wide applications. Existing binary matrix factorization (BMF) or co-clustering (CC) methods often assume i.i.d background noise. However, this assumption could be easily violated in real data, where heterogeneous row- or column-wise probability of binary entries results in disparate element-wise background distribution, and paralyzes the rationality of existing methods. We propose a binary data denoising framework, namely BIND, which optimizes the detection of true patterns by estimating the row- or column-wise mixture distribution of patterns and disparate background, and eliminating the binary attributes that are more likely from the background. BIND is supported by thoroughly derived mathematical property of the row- and column-wise mixture distributions. Our experiment on synthetic and real-world data demonstrated BIND effectively removes background noise and drastically increases the fairness and accuracy of state-of-the arts BMF and CC methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/11/2019

Low-rank matrix completion and denoising under Poisson noise

This paper considers the problem of estimating a low-rank matrix from th...
research
07/29/2020

Extreme-K categorical samples problem

With histograms as its foundation, we develop Categorical Exploratory Da...
research
06/27/2012

Inferring Latent Structure From Mixed Real and Categorical Relational Data

We consider analysis of relational data (a matrix), in which the rows co...
research
05/08/2023

Mixing time for uniform sampling of binary matrices with fixed row and column sums using the trade algorithm

The trade algorithm, which includes the curveball and fastball implement...
research
01/31/2018

Coupling geometry on binary bipartite networks: hypotheses testing on pattern geometry and nestedness

Upon a matrix representation of a binary bipartite network, via the perm...
research
05/21/2021

BELT: Block-wise Missing Embedding Learning Transformer

Matrix completion has attracted attention in many fields, including stat...

Please sign up or login with your details

Forgot password? Click here to reset