Cross-Cultural Transfer Learning for Chinese Offensive Language Detection

03/31/2023
by   Li Zhou, et al.
0

Detecting offensive language is a challenging task. Generalizing across different cultures and languages becomes even more challenging: besides lexical, syntactic and semantic differences, pragmatic aspects such as cultural norms and sensitivities, which are particularly relevant in this context, vary greatly. In this paper, we target Chinese offensive language detection and aim to investigate the impact of transfer learning using offensive language detection data from different cultural backgrounds, specifically Korean and English. We find that culture-specific biases in what is considered offensive negatively impact the transferability of language models (LMs) and that LMs trained on diverse cultural data are sensitive to different features in Chinese offensive language detection. In a few-shot learning scenario, however, our study shows promising prospects for non-English offensive language detection with limited resources. Our findings highlight the importance of cross-cultural transfer learning in improving offensive language detection and promoting inclusive digital spaces.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/31/2023

CReHate: Cross-cultural Re-annotation of English Hate Speech Dataset

English datasets predominantly reflect the perspectives of certain natio...
research
04/10/2020

Identifying Cultural Differences through Multi-Lingual Wikipedia

Understanding cross-cultural differences is an important application of ...
research
05/26/2021

Deception detection in text and its relation to the cultural dimension of individualism/collectivism

Deception detection is a task with many applications both in direct phys...
research
01/12/2019

The Importance of Socio-Cultural Differences for Annotating and Detecting the Affective States of Students

The development of real-time affect detection models often depends upon ...
research
03/25/2022

Probing Pre-Trained Language Models for Cross-Cultural Differences in Values

Language embeds information about social, cultural, and political values...
research
03/24/2020

Machine learning as a model for cultural learning: Teaching an algorithm what it means to be fat

Overweight individuals, and especially women, are disparaged as immoral,...
research
04/11/2020

Towards the B-TAMBiT: A Back-Translation with an Adjudicator with Mono and Bilingual Tests

Researchers have turned to various disciplines in search for theories th...

Please sign up or login with your details

Forgot password? Click here to reset