IndoNLI: A Natural Language Inference Dataset for Indonesian

10/27/2021
by   Rahmad Mahendra, et al.
0

We present IndoNLI, the first human-elicited NLI dataset for Indonesian. We adapt the data collection protocol for MNLI and collect nearly 18K sentence pairs annotated by crowd workers and experts. The expert-annotated data is used exclusively as a test set. It is designed to provide a challenging test-bed for Indonesian NLI by explicitly incorporating various linguistic phenomena such as numerical reasoning, structural changes, idioms, or temporal and spatial reasoning. Experiment results show that XLM-R outperforms other pre-trained models in our data. The best performance on the expert-annotated data is still far below human performance (13.4 is especially challenging. Furthermore, our analysis shows that our expert-annotated data is more diverse and contains fewer annotation artifacts than the crowd-annotated data. We hope this dataset can help accelerate progress in Indonesian NLP research.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/12/2020

OCNLI: Original Chinese Natural Language Inference

Despite the tremendous recent progress on natural language inference (NL...
research
05/05/2020

CODA-19: Reliably Annotating Research Aspects on 10,000+ CORD-19 Abstracts Using a Non-Expert Crowd

This paper introduces CODA-19, a human-annotated dataset that codes the ...
research
07/05/2023

SpaceNLI: Evaluating the Consistency of Predicting Inferences in Space

While many natural language inference (NLI) datasets target certain sema...
research
05/05/2020

CODA-19: Reliably Annotating Research Aspects on 10,000+ CORD-19 Abstracts Using Non-Expert Crowd

This paper introduces CODA-19, a human-annotated dataset that denotes th...
research
01/10/2018

Exploring Stereotypes and Biased Data with the Crowd

The goal of our research is to contribute information about how useful t...
research
10/13/2020

RuSemShift: a dataset of historical lexical semantic change in Russian

We present RuSemShift, a large-scale manually annotated test set for the...
research
10/16/2021

Analyzing Dynamic Adversarial Training Data in the Limit

To create models that are robust across a wide range of test inputs, tra...

Please sign up or login with your details

Forgot password? Click here to reset