Developing and Evaluating Tiny to Medium-Sized Turkish BERT Models

07/26/2023
by   Himmet Toprak Kesgin, et al.
0

This study introduces and evaluates tiny, mini, small, and medium-sized uncased Turkish BERT models, aiming to bridge the research gap in less-resourced languages. We trained these models on a diverse dataset encompassing over 75GB of text from multiple sources and tested them on several tasks, including mask prediction, sentiment analysis, news classification, and, zero-shot classification. Despite their smaller size, our models exhibited robust performance, including zero-shot task, while ensuring computational efficiency and faster execution times. Our findings provide valuable insights into the development and application of smaller language models, especially in the context of the Turkish language.

READ FULL TEXT
research
12/06/2021

Zero-shot hashtag segmentation for multilingual sentiment analysis

Hashtag segmentation, also known as hashtag decomposition, is a common s...
research
05/23/2023

Multilingual Large Language Models Are Not (Yet) Code-Switchers

Multilingual Large Language Models (LLMs) have recently shown great capa...
research
08/21/2023

Zero- and Few-Shot Prompting with LLMs: A Comparative Study with Fine-tuned Models for Bangla Sentiment Analysis

The rapid expansion of the digital world has propelled sentiment analysi...
research
12/14/2022

Multi-task Learning for Cross-Lingual Sentiment Analysis

This paper presents a cross-lingual sentiment analysis of news articles ...
research
05/22/2023

Automated stance detection in complex topics and small languages: the challenging case of immigration in polarizing news media

Automated stance detection and related machine learning methods can prov...
research
10/26/2022

Large language models are not zero-shot communicators

Despite widespread use of LLMs as conversational agents, evaluations of ...
research
05/13/2021

Are Larger Pretrained Language Models Uniformly Better? Comparing Performance at the Instance Level

Larger language models have higher accuracy on average, but are they bet...

Please sign up or login with your details

Forgot password? Click here to reset