Measuring Harmful Representations in Scandinavian Language Models

11/21/2022
by   Samia Touileb, et al.
2

Scandinavian countries are perceived as role-models when it comes to gender equality. With the advent of pre-trained language models and their widespread usage, we investigate to what extent gender-based harmful and toxic content exist in selected Scandinavian language models. We examine nine models, covering Danish, Swedish, and Norwegian, by manually creating template-based sentences and probing the models for completion. We evaluate the completions using two methods for measuring harmful and toxic completions and provide a thorough analysis of the results. We show that Scandinavian pre-trained language models contain harmful and gender-based stereotypes with similar values across all languages. This finding goes against the general expectations related to gender equality in Scandinavian countries and shows the possible problematic outcomes of using such models in real-world settings.

READ FULL TEXT
research
04/12/2023

Measuring Normative and Descriptive Biases in Language Models Using Census Data

We investigate in this paper how distributions of occupations with respe...
research
12/05/2022

INCLUSIFY: A benchmark and a model for gender-inclusive German

Gender-inclusive language is important for achieving gender equality in ...
research
10/11/2021

Improving Gender Fairness of Pre-Trained Language Models without Catastrophic Forgetting

Although pre-trained language models, such as BERT, achieve state-of-art...
research
04/20/2022

Analyzing Gender Representation in Multilingual Models

Multilingual language models were shown to allow for nontrivial transfer...
research
08/23/2021

For Better or for Worse? A Framework for Critical Analysis of ICT4D for Women

Diffusion of ICTs provide possibilities for women empowerment by greater...
research
07/17/2019

Leveraging Linguistic Characteristics for Bipolar Disorder Recognition with Gender Differences

Most previous studies on automatic recognition model for bipolar disorde...
research
05/20/2023

Learning Horn Envelopes via Queries from Large Language Models

We investigate an approach for extracting knowledge from trained neural ...

Please sign up or login with your details

Forgot password? Click here to reset