BERT based classification system for detecting rumours on Twitter

by   Rini Anggrainingsih, et al.

The role of social media in opinion formation has far-reaching implications in all spheres of society. Though social media provide platforms for expressing news and views, it is hard to control the quality of posts due to the sheer volumes of posts on platforms like Twitter and Facebook. Misinformation and rumours have lasting effects on society, as they tend to influence people's opinions and also may motivate people to act irrationally. It is therefore very important to detect and remove rumours from these platforms. The only way to prevent the spread of rumours is through automatic detection and classification of social media posts. Our focus in this paper is the Twitter social medium, as it is relatively easy to collect data from Twitter. The majority of previous studies used supervised learning approaches to classify rumours on Twitter. These approaches rely on feature extraction to obtain both content and context features from the text of tweets to distinguish rumours and non-rumours. Manually extracting features however is time-consuming considering the volume of tweets. We propose a novel approach to deal with this problem by utilising sentence embedding using BERT to identify rumours on Twitter, rather than the usual feature extraction techniques. We use sentence embedding using BERT to represent each tweet's sentences into a vector according to the contextual meaning of the tweet. We classify those vectors into rumours or non-rumours by using various supervised learning techniques. Our BERT based models improved the accuracy by approximately 10


page 2

page 6

page 10


Machine Learning-based Approach for Depression Detection in Twitter Using Content and Activity Features

Social media channels, such as Facebook, Twitter, and Instagram, have al...

Keyphrase Extraction from Disaster-related Tweets

While keyphrase extraction has received considerable attention in recent...

A Study of Cyber Hate on Twitter with Implications for Social Media Governance Strategies

This paper explores ways in which the harmful effects of cyber hate may ...

Rumor Detection on Twitter Using Multiloss Hierarchical BiLSTM with an Attenuation Factor

Social media platforms such as Twitter have become a breeding ground for...

PACO: Provocation Involving Action, Culture, and Oppression

In India, people identify with a particular group based on certain attri...

Troll Tweet Detection Using Contextualized Word Representations

In recent years, numerous troll accounts that manipulate social media se...

Topic Modeling Based on Two-Step Flow Theory: Application to Tweets about Bitcoin

Digital cryptocurrencies such as Bitcoin have exploded in recent years i...

Please sign up or login with your details

Forgot password? Click here to reset