DiPD: Disruptive event Prediction Dataset from Twitter

11/25/2021
by   Sanskar Soni, et al.
0

Riots and protests, if gone out of control, can cause havoc in a country. We have seen examples of this, such as the BLM movement, climate strikes, CAA Movement, and many more, which caused disruption to a large extent. Our motive behind creating this dataset was to use it to develop machine learning systems that can give its users insight into the trending events going on and alert them about the events that could lead to disruption in the nation. If any event starts going out of control, it can be handled and mitigated by monitoring it before the matter escalates. This dataset collects tweets of past or ongoing events known to have caused disruption and labels these tweets as 1. We also collect tweets that are considered non-eventful and label them as 0 so that they can also be used to train a classification system. The dataset contains 94855 records of unique events and 168706 records of unique non-events, thus giving the total dataset 263561 records. We extract multiple features from the tweets, such as the user's follower count and the user's location, to understand the impact and reach of the tweets. This dataset might be useful in various event related machine learning problems such as event classification, event recognition, and so on.

READ FULL TEXT
research
10/01/2020

Event Detection in Twitter by Weighting Tweet's Features

In recent years, people spend a lot of time on social networks. They use...
research
02/01/2021

Understanding collective human movement dynamics during large-scale events using big geosocial data analytics

With the rapid advancement of information and communication technologies...
research
01/14/2021

On Informative Tweet Identification For Tracking Mass Events

Twitter has been heavily used as an important channel for communicating ...
research
01/05/2020

On Identifying Hashtags in Disaster Twitter Data

Tweet hashtags have the potential to improve the search for information ...
research
01/24/2019

Location reference identification from tweets during emergencies: A deep learning approach

Twitter is recently being used during crises to communicate with officia...
research
05/28/2017

A Deep Multi-View Learning Framework for City Event Extraction from Twitter Data Streams

Cities have been a thriving place for citizens over the centuries due to...
research
04/25/2016

Towards Real-Time, Country-Level Location Classification of Worldwide Tweets

In contrast to much previous work that has focused on location classific...

Please sign up or login with your details

Forgot password? Click here to reset