Using Noisy Self-Reports to Predict Twitter User Demographics

05/01/2020
by   Zach Wood-Doughty, et al.
0

Computational social science studies often contextualize content analysis within standard demographics. Since demographics are unavailable on many social media platforms (e.g. Twitter) numerous studies have inferred demographics automatically. Despite many studies presenting proof of concept inference of race and ethnicity, training of practical systems remains elusive since there are few annotated datasets. Existing datasets are small, inaccurate, or fail to cover the four most common racial and ethnic groups in the United States. We present a method to identify self-reports of race and ethnicity from Twitter profile descriptions. Despite errors inherent in automated supervision, we produce models with good performance when measured on gold standard self-report survey data. The result is a reproducible method for creating large-scale training resources for race and ethnicity.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/30/2017

Demographics in Social Media Data for Public Health Research: Does it matter?

Social media data provides propitious opportunities for public health re...
research
03/30/2018

Characterizing Interconnections and Linguistic Patterns in Twitter

Social media is considered a democratic space in which people connect an...
research
05/13/2015

Reporting, Reviewing, and Responding to Harassment on Twitter

When people experience harassment online, from individual threats or inv...
research
03/22/2022

A Method for Estimating Individual Socioeconomic Status of Twitter Users

The rise of social media and computational social science (CSS) has open...
research
10/02/2019

Race and Religion in Online Abuse towards UK Politicians: Working Paper

Against a backdrop of tensions related to EU membership, we find levels ...
research
05/15/2019

Understanding the Radical Mind: Identifying Signals to Detect Extremist Content on Twitter

The Internet and, in particular, Online Social Networks have changed the...

Please sign up or login with your details

Forgot password? Click here to reset