Large-scale diversity estimation through surname origin inference

04/13/2018
by   Antoine Mazières, et al.
0

The study of surnames as both linguistic and geographical markers of the past has proven valuable in several research fields spanning from biology and genetics to demography and social mobility. This article builds upon the existing literature to conceive and develop a surname origin classifier based on a data-driven typology. This enables us to explore a methodology to describe large-scale estimates of the relative diversity of social groups, especially when such data is scarcely available. We subsequently analyze the representativeness of surname origins for 15 socio-professional groups in France.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/06/2020

A Framework for the Computational Linguistic Analysis of Dehumanization

Dehumanization is a pernicious psychological process that often leads to...
research
12/10/2018

Diversity of Artists in Major U.S. Museums

The U.S. art museum sector is grappling with diversity. While previous w...
research
05/14/2021

On Measuring the Diversity of Organizational Networks

The interaction patterns of employees in social and professional network...
research
07/14/2022

Origin of life from a maker's perspective – focus on protocellular compartments in bottom-up synthetic biology

The origin of life is shrouded in mystery, with few surviving clues, obs...
research
04/23/2021

A field guide to cultivating computational biology

Biomedical research centers can empower basic discovery and novel therap...
research
05/28/2020

A Corpus for Large-Scale Phonetic Typology

A major hurdle in data-driven research on typology is having sufficient ...
research
05/06/2021

Capturing the diversity of multilingual societies

Cultural diversity encoded within languages of the world is at risk, as ...

Please sign up or login with your details

Forgot password? Click here to reset