Beyond Plain Toxic: Detection of Inappropriate Statements on Flammable Topics for the Russian Language

03/04/2022
by   Nikolay Babakov, et al.
4

Toxicity on the Internet, such as hate speech, offenses towards particular users or groups of people, or the use of obscene words, is an acknowledged problem. However, there also exist other types of inappropriate messages which are usually not viewed as toxic, e.g. as they do not contain explicit offences. Such messages can contain covered toxicity or generalizations, incite harmful actions (crime, suicide, drug use), provoke "heated" discussions. Such messages are often related to particular sensitive topics, e.g. on politics, sexual minorities, social injustice which more often than other topics, e.g. cars or computing, yield toxic emotional reactions. At the same time, clearly not all messages within such flammable topics are inappropriate. Towards this end, in this work, we present two text collections labelled according to binary notion of inapropriateness and a multinomial notion of sensitive topic. Assuming that the notion of inappropriateness is common among people of the same culture, we base our approach on human intuitive understanding of what is not acceptable and harmful. To objectivise the notion of inappropriateness, we define it in a data-driven way though crowdsourcing. Namely we run a large-scale annotation study asking workers if a given chatbot textual statement could harm reputation of a company created it. Acceptably high values of inter-annotator agreement suggest that the notion of inappropriateness exists and can be uniformly understood by different people. To define the notion of sensitive topics in an objective way we use on guidelines suggested commonly by specialists of legal and PR department of a large public company as potentially harmful.

READ FULL TEXT
research
03/09/2021

Detecting Inappropriate Messages on Sensitive Topics that Could Harm a Company's Reputation

Not all topics are equally "flammable" in terms of toxicity: a calm disc...
research
01/27/2022

Learning Stance Embeddings from Signed Social Graphs

A key challenge in social network analysis is understanding the position...
research
02/07/2021

"Short is the Road that Leads from Fear to Hate": Fear Speech in Indian WhatsApp Groups

WhatsApp is the most popular messaging app in the world. Due to its popu...
research
02/01/2023

You Are What You Talk About: Inducing Evaluative Topics for Personality Analysis

Expressing attitude or stance toward entities and concepts is an integra...
research
12/03/2018

From the User to the Medium: Neural Profiling Across Web Communities

Online communities provide a unique way for individuals to access inform...
research
10/06/2020

Are Words Commensurate with Actions? Quantifying Commitment to a Cause from Online Public Messaging

Public entities such as companies and politicians increasingly use onlin...
research
06/17/2016

SMS Spam Filtering using Probabilistic Topic Modelling and Stacked Denoising Autoencoder

In This paper we present a novel approach to spam filtering and demonstr...

Please sign up or login with your details

Forgot password? Click here to reset