What Can We Learn From Almost a Decade of Food Tweets

07/10/2020
by   Uga Sproģis, et al.
0

We present the Latvian Twitter Eater Corpus - a set of tweets in the narrow domain related to food, drinks, eating and drinking. The corpus has been collected over time-span of over 8 years and includes over 2 million tweets entailed with additional useful data. We also separate two sub-corpora of question and answer tweets and sentiment annotated tweets. We analyse contents of the corpus and demonstrate use-cases for the sub-corpora by training domain-specific question-answering and sentiment-analysis models using data from the corpus.

READ FULL TEXT
research
06/09/2020

EPIC: An Epidemics Corpus Of Over 20 Million Relevant Tweets

Since the start of COVID-19, several relevant corpora from various sourc...
research
06/09/2020

EPIC30M: An Epidemics Corpus Of Over 30 Million Relevant Tweets

Since the start of COVID-19, several relevant corpora from various sourc...
research
10/23/2018

TweetsKB: A Public and Large-Scale RDF Corpus of Annotated Tweets

Publicly available social media archives facilitate research in a variet...
research
02/13/2019

Predicting US State-Level Agricultural Sentiment as a Measure of Food Security with Tweets from Farming Communities

The ability to obtain accurate food security metrics in developing areas...
research
06/09/2021

Fragmented and Valuable: Following Sentiment Changes in Food Tweets

We analysed sentiment and frequencies related to smell, taste and temper...
research
02/26/2018

Publishing a Quality Context-aware Annotated Corpus and Lexicon for Harassment Research

Having a quality annotated corpus is essential especially for applied re...

Please sign up or login with your details

Forgot password? Click here to reset