A large scale lexical and semantic analysis of Spanish language variations in Twitter

10/12/2021
by   Eric S. Tellez, et al.
26

Dialectometry is a discipline devoted to studying the variations of a language around a geographical region. One of their goals is the creation of linguistic atlases capturing the similarities and differences of the language under study around the area in question. For instance, Spanish is one of the most spoken languages across the world, but not necessarily Spanish is written and spoken in the same way in different countries. This manuscript presents a broad analysis describing lexical and semantic relationships among 26 Spanish-speaking countries around the globe. For this study, we analyze four-year of the Twitter geotagged public stream to provide an extensive survey of the Spanish language vocabularies of different countries, its distributions, semantic usage of terms, and emojis. We also offer open regional word-embedding resources for Spanish Twitter to help other researchers and practitioners take advantage of regionalized models.

READ FULL TEXT

page 12

page 17

research
11/16/2015

Learning about Spanish dialects through Twitter

This paper maps the large-scale variation of the Spanish language by emp...
research
10/18/2020

UoB at SemEval-2020 Task 1: Automatic Identification of Novel Word Senses

Much as the social landscape in which languages are spoken shifts, langu...
research
10/22/2015

Freshman or Fresher? Quantifying the Geographic Variation of Internet Language

We present a new computational technique to detect and analyze statistic...
research
07/26/2014

Crowdsourcing Dialect Characterization through Twitter

We perform a large-scale analysis of language diatopic variation using g...
research
07/02/2022

Language statistics at different spatial, temporal, and grammatical scales

Statistical linguistics has advanced considerably in recent decades as d...
research
05/23/2018

Grounding the Semantics of Part-of-Day Nouns Worldwide using Twitter

The usage of part-of-day nouns, such as 'night', and their time-specific...
research
11/24/2021

Third-party Service Dependencies and Centralization Around the World

There is a growing concern about consolidation trends in Internet servic...

Please sign up or login with your details

Forgot password? Click here to reset