Does Debiasing Inevitably Degrade the Model Performance

11/14/2022
by   Yiran Liu, et al.
0

Gender bias in language models has attracted sufficient attention because it threatens social justice. However, most of the current debiasing methods degraded the model's performance on other tasks while the degradation mechanism is still mysterious. We propose a theoretical framework explaining the three candidate mechanisms of the language model's gender bias. We use our theoretical framework to explain why the current debiasing methods cause performance degradation. We also discover a pathway through which debiasing will not degrade the model performance. We further develop a causality-detection fine-tuning approach to correct gender bias. The numerical experiment demonstrates that our method is able to lead to double dividends: partially mitigating gender bias while avoiding performance degradation.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/22/2018

Reducing Gender Bias in Abusive Language Detection

Abusive language detection models tend to have a problem of being biased...
research
06/02/2020

A Multi-Task Comparator Framework for Kinship Verification

Approaches for kinship verification often rely on cosine distances betwe...
research
08/24/2022

Identifying and Overcoming Transformation Bias in Forecasting Models

Log and square root transformations of target variable are routinely use...
research
07/06/2022

Gender Biases and Where to Find Them: Exploring Gender Bias in Pre-Trained Transformer-based Language Models Using Movement Pruning

Language model debiasing has emerged as an important field of study in t...
research
12/08/2022

Implicit causality in GPT-2: a case study

This case study investigates the extent to which a language model (GPT-2...
research
04/08/2022

Fair and Argumentative Language Modeling for Computational Argumentation

Although much work in NLP has focused on measuring and mitigating stereo...
research
10/20/2022

Choose Your Lenses: Flaws in Gender Bias Evaluation

Considerable efforts to measure and mitigate gender bias in recent years...

Please sign up or login with your details

Forgot password? Click here to reset