Extending the Scope of Out-of-Domain: Examining QA models in multiple subdomains

04/09/2022
by   Chenyang Lyu, et al.
0

Past works that investigate out-of-domain performance of QA systems have mainly focused on general domains (e.g. news domain, wikipedia domain), underestimating the importance of subdomains defined by the internal characteristics of QA datasets. In this paper, we extend the scope of "out-of-domain" by splitting QA examples into different subdomains according to their several internal characteristics including question type, text length, answer position. We then examine the performance of QA systems trained on the data from different subdomains. Experimental results show that the performance of QA systems can be significantly reduced when the train data and test data come from different subdomains. These results question the generalizability of current QA systems in multiple subdomains, suggesting the need to combat the bias introduced by the internal characteristics of QA datasets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/15/2021

Towards Deconfounding the Influence of Subject's Demographic Characteristics in Question Answering

Question Answering (QA) tasks are used as benchmarks of general machine ...
research
06/16/2020

Selective Question Answering under Domain Shift

To avoid giving wrong answers, question answering (QA) models need to kn...
research
05/05/2020

MultiReQA: A Cross-Domain Evaluation for Retrieval Question Answering Models

Retrieval question answering (ReQA) is the task of retrieving a sentence...
research
04/17/2022

WikiOmnia: generative QA corpus on the whole Russian Wikipedia

The General QA field has been developing the methodology referencing the...
research
09/11/2018

Does it care what you asked? Understanding Importance of Verbs in Deep Learning QA System

In this paper we present the results of an investigation of the importan...
research
04/29/2020

SubjQA: A Dataset for Subjectivity and Review Comprehension

Subjectivity is the expression of internal opinions or beliefs which can...
research
08/30/2023

Knowing Your Annotator: Rapidly Testing the Reliability of Affect Annotation

The laborious and costly nature of affect annotation is a key detrimenta...

Please sign up or login with your details

Forgot password? Click here to reset