NBIAS: A Natural Language Processing Framework for Bias Identification in Text

08/03/2023
by   Shaina Raza, et al.
0

Bias in textual data can lead to skewed interpretations and outcomes when the data is used. These biases could perpetuate stereotypes, discrimination, or other forms of unfair treatment. An algorithm trained on biased data ends up making decisions that disproportionately impact a certain group of people. Therefore, it is crucial to detect and remove these biases to ensure the fair and ethical use of data. To this end, we develop a comprehensive and robust framework Nbias that consists of a data layer, corpus contruction, model development layer and an evaluation layer. The dataset is constructed by collecting diverse data from various fields, including social media, healthcare, and job hiring portals. As such, we applied a transformer-based token classification model that is able to identify bias words/ phrases through a unique named entity. In the assessment procedure, we incorporate a blend of quantitative and qualitative evaluations to gauge the effectiveness of our models. We achieve accuracy improvements ranging from 1 baselines. We are also able to generate a robust understanding of the model functioning, capturing not only numerical data but also the quality and intricacies of its performance. The proposed approach is applicable to a variety of biases and contributes to the fair and ethical use of textual data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/25/2021

Identification of Bias Against People with Disabilities in Sentiment Analysis and Toxicity Detection Models

Sociodemographic biases are a common problem for natural language proces...
research
03/13/2023

Addressing Biases in the Texts using an End-to-End Pipeline Approach

The concept of fairness is gaining popularity in academia and industry. ...
research
10/06/2020

LOGAN: Local Group Bias Detection by Clustering

Machine learning techniques have been widely used in natural language pr...
research
08/22/2023

Targeted Data Augmentation for bias mitigation

The development of fair and ethical AI systems requires careful consider...
research
03/27/2023

Uncovering Bias in Personal Informatics

Personal informatics (PI) systems, powered by smartphones and wearables,...
research
07/19/2021

Independent Ethical Assessment of Text Classification Models: A Hate Speech Detection Case Study

An independent ethical assessment of an artificial intelligence system i...
research
02/23/2020

Fair Adversarial Networks

The influence of human judgement is ubiquitous in datasets used across t...

Please sign up or login with your details

Forgot password? Click here to reset