NBIAS: A Natural Language Processing Framework for Bias Identification in Text

by   Shaina Raza, et al.

Bias in textual data can lead to skewed interpretations and outcomes when the data is used. These biases could perpetuate stereotypes, discrimination, or other forms of unfair treatment. An algorithm trained on biased data ends up making decisions that disproportionately impact a certain group of people. Therefore, it is crucial to detect and remove these biases to ensure the fair and ethical use of data. To this end, we develop a comprehensive and robust framework Nbias that consists of a data layer, corpus contruction, model development layer and an evaluation layer. The dataset is constructed by collecting diverse data from various fields, including social media, healthcare, and job hiring portals. As such, we applied a transformer-based token classification model that is able to identify bias words/ phrases through a unique named entity. In the assessment procedure, we incorporate a blend of quantitative and qualitative evaluations to gauge the effectiveness of our models. We achieve accuracy improvements ranging from 1 baselines. We are also able to generate a robust understanding of the model functioning, capturing not only numerical data but also the quality and intricacies of its performance. The proposed approach is applicable to a variety of biases and contributes to the fair and ethical use of textual data.


page 1

page 2

page 3

page 4


Identification of Bias Against People with Disabilities in Sentiment Analysis and Toxicity Detection Models

Sociodemographic biases are a common problem for natural language proces...

Addressing Biases in the Texts using an End-to-End Pipeline Approach

The concept of fairness is gaining popularity in academia and industry. ...

LOGAN: Local Group Bias Detection by Clustering

Machine learning techniques have been widely used in natural language pr...

Targeted Data Augmentation for bias mitigation

The development of fair and ethical AI systems requires careful consider...

Uncovering Bias in Personal Informatics

Personal informatics (PI) systems, powered by smartphones and wearables,...

Independent Ethical Assessment of Text Classification Models: A Hate Speech Detection Case Study

An independent ethical assessment of an artificial intelligence system i...

Fair Adversarial Networks

The influence of human judgement is ubiquitous in datasets used across t...

Please sign up or login with your details

Forgot password? Click here to reset