KorQuAD1.0: Korean QA Dataset for Machine Reading Comprehension

09/16/2019
by   Seungyoung Lim, et al.
0

Machine Reading Comprehension (MRC) is a task that requires machine to understand natural language and answer questions by reading a document. It is the core of automatic response technology such as chatbots and automatized customer supporting systems. We present Korean Question Answering Dataset(KorQuAD), a large-scale Korean dataset for extractive machine reading comprehension task. It consists of 70,000+ human generated question-answer pairs on Korean Wikipedia articles. We release KorQuAD1.0 and launch a challenge at https://KorQuAD.github.io to encourage the development of multilingual natural language processing research.

READ FULL TEXT
research
01/05/2022

Multi Document Reading Comprehension

Reading Comprehension (RC) is a task of answering a question from a give...
research
02/13/2022

PQuAD: A Persian Question Answering Dataset

We present Persian Question Answering Dataset (PQuAD), a crowdsourced re...
research
05/19/2021

Sentence Extraction-Based Machine Reading Comprehension for Vietnamese

The development of Vietnamese language processing in general and machine...
research
11/02/2021

UQuAD1.0: Development of an Urdu Question Answering Training Data for Machine Reading Comprehension

In recent years, low-resource Machine Reading Comprehension (MRC) has ma...
research
04/02/2020

R3: A Reading Comprehension Benchmark Requiring Reasoning Processes

Existing question answering systems can only predict answers without exp...
research
06/22/2020

ReCO: A Large Scale Chinese Reading Comprehension Dataset on Opinion

This paper presents the ReCO, a human-curated ChineseReading Comprehensi...
research
08/27/2020

Relation/Entity-Centric Reading Comprehension

Constructing a machine that understands human language is one of the mos...

Please sign up or login with your details

Forgot password? Click here to reset