Detect Hate Speech in Unseen Domains using Multi-Task Learning: A Case Study of Political Public Figures

08/22/2022
by   Lanqin Yuan, et al.
0

Automatic identification of hateful and abusive content is vital in combating the spread of harmful online content and its damaging effects. Most existing works evaluate models by examining the generalization error on train-test splits on hate speech datasets. These datasets often differ in their definitions and labeling criteria, leading to poor model performance when predicting across new domains and datasets. In this work, we propose a new Multi-task Learning (MTL) pipeline that utilizes MTL to train simultaneously across multiple hate speech datasets to construct a more encompassing classification model. We simulate evaluation on new previously unseen datasets by adopting a leave-one-out scheme in which we omit a target dataset from training and jointly train on the other datasets. Our results consistently outperform a large sample of existing work. We show strong results when examining generalization error in train-test splits and substantial improvements when predicting on previously unseen datasets. Furthermore, we assemble a novel dataset, dubbed PubFigs, focusing on the problematic speech of American Public Political Figures. We automatically detect problematic speech in the 305,235 tweets in PubFigs, and we uncover insights into the posting behaviors of public figures.

READ FULL TEXT
research
11/04/2021

InQSS: a speech intelligibility assessment model using a multi-task learning network

Speech intelligibility assessment models are essential tools for researc...
research
08/11/2022

MultiMatch: Multi-task Learning for Semi-supervised Domain Generalization

Domain generalization (DG) aims at learning a model on source domains to...
research
11/04/2021

Generalization in Dexterous Manipulation via Geometry-Aware Multi-Task Learning

Dexterous manipulation of arbitrary objects, a fundamental daily task fo...
research
08/19/2019

It Takes Nine to Smell a Rat: Neural Multi-Task Learning for Check-Worthiness Prediction

We propose a multi-task deep-learning approach for estimating the check-...
research
12/10/2021

MTLTS: A Multi-Task Framework To Obtain Trustworthy Summaries From Crisis-Related Microblogs

Occurrences of catastrophes such as natural or man-made disasters trigge...
research
07/20/2023

Pre-train, Adapt and Detect: Multi-Task Adapter Tuning for Camouflaged Object Detection

Camouflaged object detection (COD), aiming to segment camouflaged object...

Please sign up or login with your details

Forgot password? Click here to reset