A Unified System for Aggression Identification in English Code-Mixed and Uni-Lingual Texts

01/15/2020
by   Anant Khandelwal, et al.
0

Wide usage of social media platforms has increased the risk of aggression, which results in mental stress and affects the lives of people negatively like psychological agony, fighting behavior, and disrespect to others. Majority of such conversations contains code-mixed languages[28]. Additionally, the way used to express thought or communication style also changes from one social media plat-form to another platform (e.g., communication styles are different in twitter and Facebook). These all have increased the complexity of the problem. To solve these problems, we have introduced a unified and robust multi-modal deep learning architecture which works for English code-mixed dataset and uni-lingual English dataset both.The devised system, uses psycho-linguistic features and very ba-sic linguistic features. Our multi-modal deep learning architecture contains, Deep Pyramid CNN, Pooled BiLSTM, and Disconnected RNN(with Glove and FastText embedding, both). Finally, the system takes the decision based on model averaging. We evaluated our system on English Code-Mixed TRAC 2018 dataset and uni-lingual English dataset obtained from Kaggle. Experimental results show that our proposed system outperforms all the previous approaches on English code-mixed dataset and uni-lingual English dataset.

READ FULL TEXT
research
01/15/2020

AggressionNet: Generalised Multi-Modal Deep Temporal and Sequential Learning for Aggression Identification

Wide usage of social media platforms has increased the risk of aggressio...
research
02/01/2017

SMPOST: Parts of Speech Tagger for Code-Mixed Indic Social Media Text

Use of social media has grown dramatically during the last few years. Us...
research
03/31/2021

Misinformation detection in Luganda-English code-mixed social media text

The increasing occurrence, forms, and negative effects of misinformation...
research
03/26/2018

Aggression-annotated Corpus of Hindi-English Code-mixed Data

As the interaction over the web has increased, incidents of aggression a...
research
10/17/2020

CUSATNLP@HASOC-Dravidian-CodeMix-FIRE2020:Identifying Offensive Language from ManglishTweets

With the popularity of social media, communications through blogs, Faceb...
research
03/25/2022

L3Cube-MahaHate: A Tweet-based Marathi Hate Speech Detection Dataset and BERT models

Social media platforms are used by a large number of people prominently ...
research
12/30/2019

"Hinglish" Language – Modeling a Messy Code-Mixed Language

With a sharp rise in fluency and users of "Hinglish" in linguistically d...

Please sign up or login with your details

Forgot password? Click here to reset