Modern Question Answering Datasets and Benchmarks: A Survey

by   Zhen Wang, et al.
Delft University of Technology

Question Answering (QA) is one of the most important natural language processing (NLP) tasks. It aims using NLP technologies to generate a corresponding answer to a given question based on the massive unstructured corpus. With the development of deep learning, more and more challenging QA datasets are being proposed, and lots of new methods for solving them are also emerging. In this paper, we investigate influential QA datasets that have been released in the era of deep learning. Specifically, we begin with introducing two of the most common QA tasks - textual question answer and visual question answering - separately, covering the most representative datasets, and then give some current challenges of QA research.


page 1

page 2

page 3

page 4


Towards Deconfounding the Influence of Subject's Demographic Characteristics in Question Answering

Question Answering (QA) tasks are used as benchmarks of general machine ...

emrQA: A Large Corpus for Question Answering on Electronic Medical Records

We propose a novel methodology to generate domain-specific large-scale q...

Learning to Paraphrase for Question Answering

Question answering (QA) systems are sensitive to the many different ways...

Incidental Supervision from Question-Answering Signals

Human annotations are costly for many natural language processing (NLP) ...

Retrieving and Reading: A Comprehensive Survey on Open-domain Question Answering

Open-domain Question Answering (OpenQA) is an important task in Natural ...

AI-based Question Answering Assistance for Analyzing Natural-language Requirements

By virtue of being prevalently written in natural language (NL), require...

Please sign up or login with your details

Forgot password? Click here to reset