ModSandbox: Facilitating Online Community Moderation Through Error Prediction and Improvement of Automated Rules

10/18/2022
by   Jean Y. Song, et al.
0

Despite the common use of rule-based tools for online content moderation, human moderators still spend a lot of time monitoring them to ensure that they work as intended. Based on surveys and interviews with Reddit moderators who use AutoModerator, we identified the main challenges in reducing false positives and false negatives of automated rules: not being able to estimate the actual effect of a rule in advance and having difficulty figuring out how the rules should be updated. To address these issues, we built ModSandbox, a novel virtual sandbox system that detects possible false positives and false negatives of a rule to be improved and visualizes which part of the rule is causing issues. We conducted a user study with online content moderators, finding that ModSandbox can support quickly finding possible false positives and false negatives of automated rules and guide moderators to update those to reduce future errors.

READ FULL TEXT
research
10/24/2022

Optimal Decision Rules for the Discursive Dilemma

We study the classical discursive dilemma from the point of view of find...
research
10/20/2017

Solving the "false positives" problem in fraud prediction

In this paper, we present an automated feature engineering based approac...
research
03/04/2021

GAssert: A Fully Automated Tool to Improve Assertion Oracles

This demo presents the implementation and usage details of GASSERT, the ...
research
02/14/2020

ARMS: Automated rules management system for fraud detection

Fraud detection is essential in financial services, with the potential o...
research
08/24/2022

Graphical Models of False Information and Fact Checking Ecosystems

The wide spread of false information online including misinformation and...
research
06/25/2021

Improving Human Decisions by Adjusting the Alerting Thresholds for Computer Alerting Tools According to User and Task Characteristics

Objective: To investigate whether performance (number of correct decisio...

Please sign up or login with your details

Forgot password? Click here to reset