TweetsKB: A Public and Large-Scale RDF Corpus of Annotated Tweets

10/23/2018
by   Pavlos Fafalios, et al.
0

Publicly available social media archives facilitate research in a variety of fields, such as data science, sociology or the digital humanities, where Twitter has emerged as one of the most prominent sources. However, obtaining, archiving and annotating large amounts of tweets is costly. In this paper, we describe TweetsKB, a publicly available corpus of currently more than 1.5 billion tweets, spanning almost 5 years (Jan'13-Nov'17). Metadata information about the tweets as well as extracted entities, hashtags, user mentions and sentiment information are exposed using established RDF/S vocabularies. Next to a description of the extraction and annotation process, we present use cases to illustrate scenarios for entity-centric information exploration, data integration and knowledge discovery facilitated by TweetsKB.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/25/2020

TweetsCOV19 – A Knowledge Base of Semantically Annotated Tweets about the COVID-19 Pandemic

Publicly available social media archives facilitate research in the soci...
research
03/31/2020

A large-scale Twitter dataset for drug safety applications mined from publicly existing resources

With the increase in popularity of deep learning models for natural lang...
research
07/10/2020

What Can We Learn From Almost a Decade of Food Tweets

We present the Latvian Twitter Eater Corpus - a set of tweets in the nar...
research
07/27/2021

A Biomedically oriented automatically annotated Twitter COVID-19 Dataset

The use of social media data, like Twitter, for biomedical research has ...
research
04/11/2021

NorDial: A Preliminary Corpus of Written Norwegian Dialect Use

Norway has a large amount of dialectal variation, as well as a general t...
research
02/01/2021

Understanding collective human movement dynamics during large-scale events using big geosocial data analytics

With the rapid advancement of information and communication technologies...
research
03/23/2018

Stance Detection on Tweets: An SVM-based Approach

Stance detection is a subproblem of sentiment analysis where the stance ...

Please sign up or login with your details

Forgot password? Click here to reset