NeMig – A Bilingual News Collection and Knowledge Graph about Migration

09/01/2023
by   Andreea Iana, et al.
0

News recommendation plays a critical role in shaping the public's worldviews through the way in which it filters and disseminates information about different topics. Given the crucial impact that media plays in opinion formation, especially for sensitive topics, understanding the effects of personalized recommendation beyond accuracy has become essential in today's digital society. In this work, we present NeMig, a bilingual news collection on the topic of migration, and corresponding rich user data. In comparison to existing news recommendation datasets, which comprise a large variety of monolingual news, NeMig covers articles on a single controversial topic, published in both Germany and the US. We annotate the sentiment polarization of the articles and the political leanings of the media outlets, in addition to extracting subtopics and named entities disambiguated through Wikidata. These features can be used to analyze the effects of algorithmic news curation beyond accuracy-based performance, such as recommender biases and the creation of filter bubbles. We construct domain-specific knowledge graphs from the news text and metadata, thus encoding knowledge-level connections between articles. Importantly, while existing datasets include only click behavior, we collect user socio-demographic and political information in addition to explicit click feedback. We demonstrate the utility of NeMig through experiments on the tasks of news recommenders benchmarking, analysis of biases in recommenders, and news trends analysis. NeMig aims to provide a useful resource for the news recommendation community and to foster interdisciplinary research into the multidimensional effects of algorithmic news curation.

READ FULL TEXT

page 7

page 15

research
03/11/2022

Towards Analyzing the Bias of News Recommender Systems Using Sentiment and Stance Detection

News recommender systems are used by online news providers to alleviate ...
research
03/04/2013

Personalized News Recommendation with Context Trees

The profusion of online news articles makes it difficult to find interes...
research
05/24/2019

Content based News Recommendation via Shortest Entity Distance over Knowledge Graphs

Content-based news recommendation systems need to recommend news article...
research
08/17/2018

Characterizing the public perception of WhatsApp through the lens of media

WhatsApp is, as of 2018, a significant component of the global informati...
research
05/27/2020

The POLUSA Dataset: 0.9M Political News Articles Balanced by Time and Outlet Popularity

News articles covering policy issues are an essential source of informat...
research
02/14/2020

A network perspective on intermedia agenda-setting

In Communication Theory, intermedia agenda-setting refers to the influen...
research
05/03/2020

Extracting Entities and Topics from News and Connecting Criminal Records

The goal of this paper is to summarize methodologies used in extracting ...

Please sign up or login with your details

Forgot password? Click here to reset