Are We Consistently Biased? Multidimensional Analysis of Biases in Distributional Word Vectors

04/26/2019
by   Anne Lauscher, et al.
0

Word embeddings have recently been shown to reflect many of the pronounced societal biases (e.g., gender bias or racial bias). Existing studies are, however, limited in scope and do not investigate the consistency of biases across relevant dimensions like embedding models, types of texts, and different languages. In this work, we present a systematic study of biases encoded in distributional word vector spaces: we analyze how consistent the bias effects are across languages, corpora, and embedding models. Furthermore, we analyze the cross-lingual biases encoded in bilingual embedding spaces, indicative of the effects of bias transfer encompassed in cross-lingual transfer of NLP models. Our study yields some unexpected findings, e.g., that biases can be emphasized or downplayed by different embedding models or that user-generated content may be less biased than encyclopedic text. We hope our work catalyzes bias research in NLP and informs the development of bias reduction techniques.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/13/2019

A General Framework for Implicit and Explicit Debiasing of Distributional Word Vector Spaces

Distributional word vectors have recently been shown to encode many of t...
research
11/03/2020

AraWEAT: Multidimensional Analysis of Biases in Arabic Word Embeddings

Recent work has shown that distributional word vector spaces often encod...
research
05/23/2019

Fair is Better than Sensational:Man is to Doctor as Woman is to Doctor

Analogies such as man is to king as woman is to X are often used to illu...
research
03/11/2021

DebIE: A Platform for Implicit and Explicit Debiasing of Word Embedding Spaces

Recent research efforts in NLP have demonstrated that distributional wor...
research
03/09/2021

Bias and sensitivity analysis for unmeasured confounders in linear structural equation models

In this paper, we consider the extent of the biases that may arise when ...
research
05/21/2023

Measuring Intersectional Biases in Historical Documents

Data-driven analyses of biases in historical texts can help illuminate t...
research
03/01/2021

WordBias: An Interactive Visual Tool for Discovering Intersectional Biases Encoded in Word Embeddings

Intersectional bias is a bias caused by an overlap of multiple social fa...

Please sign up or login with your details

Forgot password? Click here to reset