Not to Overfit or Underfit? A Study of Domain Generalization in Question Answering

05/15/2022
by   Md Arafat Sultan, et al.
0

Machine learning models are prone to overfitting their source (training) distributions, which is commonly believed to be why they falter in novel target domains. Here we examine the contrasting view that multi-source domain generalization (DG) is in fact a problem of mitigating source domain underfitting: models not adequately learning the signal in their multi-domain training data. Experiments on a reading comprehension DG benchmark show that as a model gradually learns its source domains better – using known methods such as knowledge distillation from a larger model – its zero-shot out-of-domain accuracy improves at an even faster rate. Improved source domain learning also demonstrates superior generalization over three popular domain-invariant learning methods that aim to counter overfitting.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/14/2022

Task Transfer and Domain Adaptation for Zero-Shot Question Answering

Pretrained language models have shown success in various areas of natura...
research
04/04/2023

ERM++: An Improved Baseline for Domain Generalization

Multi-source Domain Generalization (DG) measures a classifier's ability ...
research
11/25/2019

Unsupervised Domain Adaptation of Language Models for Reading Comprehension

This study tackles unsupervised domain adaptation of reading comprehensi...
research
02/05/2023

Aggregation of Disentanglement: Reconsidering Domain Variations in Domain Generalization

Domain Generalization (DG) is a fundamental challenge for machine learni...
research
06/30/2021

Zero-Shot Estimation of Base Models' Weights in Ensemble of Machine Reading Comprehension Systems for Robust Generalization

One of the main challenges of the machine reading comprehension (MRC) mo...
research
09/28/2021

Single-dataset Experts for Multi-dataset Question Answering

Many datasets have been created for training reading comprehension model...
research
10/06/2021

Dynamically Decoding Source Domain Knowledge For Unseen Domain Generalization

Domain generalization is an important problem which has gain much attent...

Please sign up or login with your details

Forgot password? Click here to reset